MoPO: Incorporating Motion Prior for Occluded Human Mesh Recovery
Pith reviewed 2026-05-12 04:55 UTC · model grok-4.3
The pith
Motion sequences from prior poses can reliably complete occluded joints to recover accurate human meshes.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that incorporating motion prior for occluded human mesh recovery, through a motion de-occlusion module that detects joint visibility and predicts plausible positions for occluded parts from history poses, combined with motion-aware fusion and refinement using inverse kinematics, produces more accurate and temporally consistent human meshes than methods relying only on occluded image features.
What carries the argument
The motion de-occlusion module, which uses a spatial-temporal occlusion detector followed by a lightweight motion predictor to complete occluded joint positions based on prior pose sequences, supplying occlusion-free prior for shape and pose estimation.
If this is right
- Higher accuracy for estimating positions and shapes of occluded body parts in single-image human mesh recovery.
- Reduced motion jitter and improved temporal consistency in video-based recovery results.
- State-of-the-art performance on both occlusion-specific and standard human mesh benchmarks.
- Better robustness in real-world scenes with frequent partial occlusions such as crowds or objects blocking the view.
Where Pith is reading between the lines
- This approach could extend to other vision tasks with partial observations by treating motion or temporal context as a completion prior.
- The lightweight design of the predictor suggests it may integrate into existing mesh recovery pipelines with low additional computational cost.
- More advanced motion predictors could be swapped in to test further gains in prediction accuracy for occluded regions.
- The method points toward hybrid systems that combine image features with learned motion dynamics for handling sensor or viewpoint limitations.
Load-bearing premise
That pose sequences inherently contain reliable motion prior for estimating occluded body parts and that the lightweight motion predictor can accurately complete them without introducing errors that propagate to the final mesh and pose estimates.
What would settle it
A controlled test on sequences with known ground-truth occluded joints where the motion predictor's completed positions are compared to actual values, or where disabling the motion completion step causes performance to fall below image-only baselines on occluded benchmarks.
Figures
read the original abstract
Although recent studies have made remarkable progress in human mesh recovery, they still exhibit limited robustness to occlusions and often produce inaccurate poses and severe motion jitter due to the insufficient spatial features for occluded body parts. Inspired by the rapid advancements in human motion prediction, we discover that compared to occluded image features, pose sequence inherently contains reliable motion prior for estimating occluded body parts. In this paper, we incorporate Motion Prior for Occluded human mesh recovery, called MoPO. Our MoPO mainly consists of two components: 1) The motion de-occlusion module, where we propose a spatial-temporal occlusion detector to detect joint visibility, and then we propose a lightweight motion predictor to complete the occluded body parts by predicting the most plausible joint positions based on history poses. 2) The motion-aware fusion and refinement module, which fuses the completed joint sequence with image features to estimate human shape and initial human pose. Moreover, the completed joint sequence is further used to refine the final human pose through inverse kinematics, which provides the occlusion-free motion prior for regressing human poses. Extensive experiments demonstrate that MoPO achieves state-of-the-art performance on both occlusion-specific and standard benchmarks, significantly enhancing the accuracy and temporal consistency of occluded human mesh recovery. Our code and demo can be found in the supplementary material.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes MoPO for occluded human mesh recovery. It uses a motion de-occlusion module consisting of a spatial-temporal occlusion detector to identify invisible joints and a lightweight motion predictor that completes occluded joints from history pose sequences. These completed poses are then fused with image features in a motion-aware fusion and refinement module to estimate shape and initial pose, with inverse kinematics applied for final pose refinement using the occlusion-free motion prior. The authors claim this yields state-of-the-art results on both occlusion-specific and standard benchmarks while improving accuracy and temporal consistency over prior methods.
Significance. If the central claims hold after validation, the work would be significant for computer vision applications involving video or crowded scenes, as it offers a principled way to leverage temporal motion priors to compensate for missing spatial features in occluded regions. The lightweight predictor design could also support efficient deployment, and the overall approach builds on recent motion prediction advances in a way that addresses a persistent weakness in human mesh recovery pipelines.
major comments (3)
- [Abstract and §4] Abstract and §4 (Experiments): The claim that the lightweight motion predictor produces accurate completions that improve (rather than degrade) final mesh and pose estimates is load-bearing for the central thesis, yet no standalone quantitative evaluation of the predictor is reported (e.g., MPJPE or PCK on held-out occluded joints) and no ablation isolates error propagation through the fusion and IK stages. Without these, it is impossible to confirm that pose-sequence priors are reliably superior to occluded image features.
- [§3.1] §3.1 (Motion de-occlusion module): The spatial-temporal occlusion detector and lightweight predictor are introduced at a high level, but the manuscript provides no derivation or training objective for the predictor, no analysis of its behavior under long occlusions or ambiguous dynamics, and no comparison against stronger motion-prediction baselines. This leaves open the risk that hallucinated joint positions directly corrupt the subsequent motion-aware fusion.
- [§3.2] §3.2 (Motion-aware fusion and refinement): The fusion of completed joint sequences with image features and the inverse-kinematics refinement step are described without equations or pseudocode showing how the motion prior is injected or how conflicts between image evidence and predicted joints are resolved. This detail is required to assess whether the claimed temporal consistency gains are reproducible and robust.
minor comments (2)
- The abstract states that code and demo are in the supplementary material, but the main text should include at least a brief implementation paragraph (network architecture, training schedule, loss weights) to aid reviewers and readers.
- Notation for joint visibility and completed pose sequences is introduced without a clear table or diagram; adding one would improve readability of the method.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. The comments highlight important areas where additional evaluation and technical detail will strengthen the manuscript. We address each major comment below and have revised the paper accordingly to improve clarity, reproducibility, and support for the central claims.
read point-by-point responses
-
Referee: [Abstract and §4] Abstract and §4 (Experiments): The claim that the lightweight motion predictor produces accurate completions that improve (rather than degrade) final mesh and pose estimates is load-bearing for the central thesis, yet no standalone quantitative evaluation of the predictor is reported (e.g., MPJPE or PCK on held-out occluded joints) and no ablation isolates error propagation through the fusion and IK stages. Without these, it is impossible to confirm that pose-sequence priors are reliably superior to occluded image features.
Authors: We agree that a direct, standalone evaluation of the motion predictor is necessary to substantiate the claim that motion priors improve rather than degrade estimates. In the revised manuscript we have added a new subsection in §4 with quantitative results on the predictor alone: MPJPE and PCK computed on held-out occluded joints using the occlusion detector outputs, together with an ablation that measures error propagation through the motion-aware fusion and IK stages. These results show consistent improvement over image-only baselines and confirm that the completed poses are beneficial. revision: yes
-
Referee: [§3.1] §3.1 (Motion de-occlusion module): The spatial-temporal occlusion detector and lightweight predictor are introduced at a high level, but the manuscript provides no derivation or training objective for the predictor, no analysis of its behavior under long occlusions or ambiguous dynamics, and no comparison against stronger motion-prediction baselines. This leaves open the risk that hallucinated joint positions directly corrupt the subsequent motion-aware fusion.
Authors: We have expanded §3.1 with the explicit training objective (L2 joint loss plus temporal smoothness regularizer) and the network architecture details. We also added an analysis subsection in the experiments that reports predictor accuracy as a function of occlusion duration and motion ambiguity, plus direct comparisons against stronger motion-prediction baselines (e.g., recent transformer-based predictors). These additions demonstrate that the lightweight predictor remains competitive while preserving efficiency. revision: yes
-
Referee: [§3.2] §3.2 (Motion-aware fusion and refinement): The fusion of completed joint sequences with image features and the inverse-kinematics refinement step are described without equations or pseudocode showing how the motion prior is injected or how conflicts between image evidence and predicted joints are resolved. This detail is required to assess whether the claimed temporal consistency gains are reproducible and robust.
Authors: We have revised §3.2 to include the full mathematical formulation of the motion-aware fusion (feature weighting by occlusion scores) and the IK refinement objective. We also provide pseudocode for the overall refinement pipeline that explicitly shows how image evidence and motion priors are combined and how conflicts are resolved via weighted least-squares IK. These additions make the temporal consistency improvements reproducible. revision: yes
Circularity Check
No circularity: method is a trained architecture evaluated on external benchmarks
full rationale
The paper introduces MoPO as a two-module pipeline (motion de-occlusion via detector + lightweight predictor, followed by motion-aware fusion, shape regression, and IK refinement) whose motion prior is explicitly drawn from prior external work on human motion prediction. The central claims rest on training the components end-to-end and reporting performance on occlusion-specific and standard benchmarks; no equations, parameters, or results are defined in terms of the target outputs, no fitted subset is relabeled as a prediction, and no load-bearing premise reduces to a self-citation chain. The derivation chain is therefore self-contained against external data and evaluation.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Pose sequence inherently contains reliable motion prior for estimating occluded body parts
Reference graph
Works this paper leans on
-
[1]
Y . Tian, H. Zhang, Y . Liu, L. Wang, Recovering 3D human mesh from monocular images: A survey, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 45 (12) (2023) 15406–15425
work page 2023
-
[2]
W. Li, M. Liu, H. Liu, T. Guo, T. Wang, H. Tang, N. Sebe, Graphmlp: A graph mlp-like architecture for 3d human pose estimation, Pattern Recognition 158 (2025) 110925
work page 2025
- [3]
-
[4]
M. Kocabas, N. Athanasiou, M. J. Black, Vibe: Video inference for human body pose and shape estimation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 5253–5263
work page 2020
-
[5]
M. Kocabas, C.-H. P. Huang, O. Hilliges, M. J. Black, Pare: Part attention regressor for 3D human body estimation, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 11127–11137
work page 2021
-
[6]
J. Li, Z. Yang, X. Wang, J. Ma, C. Zhou, Y . Yang, Jotr: 3D joint contrastive learning with transformers for occluded human mesh recovery, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 9110–9121
work page 2023
-
[7]
Y . Zhu, A. Li, Y . Tang, W. Zhao, J. Zhou, J. Lu, Dpmesh: Exploiting diffusion prior for occluded human mesh recovery, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 1101–1110
work page 2024
- [8]
-
[9]
H. Choi, G. Moon, K. M. Lee, Pose2mesh: Graph convolutional network for 3D human pose and mesh recovery from a 2D human pose, in: Proceedings of the European Conference on Computer Vision (ECCV), 2020, pp. 769–787
work page 2020
-
[10]
Y . You, H. Liu, T. Wang, W. Li, R. Ding, X. Li, Co-evolution of pose and mesh for 3D human body estimation from video, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 14963–14973
work page 2023
-
[11]
Y . Sun, Q. Bao, W. Liu, Y . Fu, M. J. Black, T. Mei, Monocular, one-stage, re- gression of multiple 3D people, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 11179–11188
work page 2021
-
[12]
H. Choi, G. Moon, J. Park, K. M. Lee, Learning to estimate robust 3D human mesh from in-the-wild crowded scenes, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 1475–1484
work page 2022
-
[13]
C.-H. Yao, J. Yang, D. Ceylan, Y . Zhou, Y . Zhou, M.-H. Yang, Learning visi- 28 bility for robust dense human body estimation, in: Proceedings of the European Conference on Computer Vision (ECCV), Springer, 2022, pp. 412–428
work page 2022
-
[14]
C. Yang, K. Kong, S. Min, D. Wee, H.-D. Jang, G. Cha, S. Kang, Sefd: learning to distill complex pose and occlusion, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 14941–14952
work page 2023
-
[15]
M.-G. Gwon, G.-M. Um, W.-S. Cheong, W. Kim, Instance-aware contrastive learn- ing for occluded human mesh reconstruction, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 10553–10562
work page 2024
-
[16]
Y . Sun, W. Liu, Q. Bao, Y . Fu, T. Mei, M. J. Black, Putting people in their place: Monocular regression of 3D people in depth, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 13243–13252
work page 2022
-
[17]
S. Bian, J. Li, J. Tang, C. Lu, Shapeboost: Boosting human shape estimation with part-based parameterization and clothing-preserving augmentation, in: Proceed- ings of the AAAI Conference on Artificial Intelligence (AAAI), V ol. 38, 2024, pp. 828–836
work page 2024
- [18]
-
[19]
K. Lyu, H. Chen, Z. Liu, B. Zhang, R. Wang, 3D human motion prediction: A survey, Neurocomputing 489 (2022) 345–365
work page 2022
-
[20]
R. Rombach, A. Blattmann, D. Lorenz, P. Esser, B. Ommer, High-resolution image synthesis with latent diffusion models, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 10684–10695
work page 2022
-
[21]
Y . Xu, J. P. Zhang, Q. Zhang, D. Tao, Vitpose: Simple vision transformer baselines for human pose estimation, Advances in Neural Information Processing Systems 29 (NeurIPS) 35 (2022) 38571–38584
work page 2022
- [22]
-
[23]
S. Goel, G. Pavlakos, J. Rajasegaran, A. Kanazawa, J. Malik, Humans in 4d: Reconstructing and tracking humans with transformers, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 14783– 14794
work page 2023
-
[24]
Z. Li, J. Liu, Z. Zhang, S. Xu, Y . Yan, Cliff: Carrying location information in full frames into human pose and shape estimation, in: Proceedings of the European Conference on Computer Vision (ECCV), 2022, pp. 590–606
work page 2022
-
[25]
N. Kolotouros, G. Pavlakos, M. J. Black, K. Daniilidis, Learning to reconstruct 3D human pose and shape via model-fitting in the loop, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 2252–2261
work page 2019
-
[26]
Y . Zhou, C. Barnes, J. Lu, J. Yang, H. Li, On the continuity of rotation repre- sentations in neural networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 5745–5753
work page 2019
-
[27]
T. V on Marcard, R. Henschel, M. J. Black, B. Rosenhahn, G. Pons-Moll, Recov- ering accurate 3D human pose in the wild using imus and a moving camera, in: Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 601–617
work page 2018
- [28]
-
[29]
H. Joo, H. Liu, L. Tan, L. Gui, B. Nabbe, I. Matthews, T. Kanade, S. Nobuhara, Y . Sheikh, Panoptic studio: A massively multiview system for social motion capture, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2015, pp. 3334–3342. 30
work page 2015
-
[30]
M. Liu, J. Liu, Adaptive learning from noisy estimated depth maps benefits monocular rgb-based 3d human pose estimation, Pattern Recognition 179 (2026) 113530
work page 2026
-
[31]
K. Shetty, A. Birkhold, S. Jaganathan, N. Strobel, M. Kowarschik, A. Maier, B. Egger, Pliks: A pseudo-linear inverse kinematic solver for 3D human body estimation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 574–584
work page 2023
- [32]
-
[33]
J. Li, S. Bian, Q. Liu, J. Tang, F. Wang, C. Lu, Niki: Neural inverse kinematics with invertible neural networks for 3D human pose and shape estimation, in: Proceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 12933–12942
work page 2023
-
[34]
J. Li, C. Xu, Z. Chen, S. Bian, L. Yang, C. Lu, Hybrik: A hybrid analytical- neural inverse kinematics solution for 3D human pose and shape estimation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 3383–3393
work page 2021
-
[35]
Z. Luo, S. A. Golestaneh, K. M. Kitani, 3D human motion estimation via motion compression and refinement, in: Proceedings of the Asian Conference on Computer Vision (ACCV), Springer, 2020, pp. 324–340
work page 2020
-
[36]
H. Choi, G. Moon, J. Y . Chang, K. M. Lee, Beyond static features for tempo- rally consistent 3D human pose and shape from a video, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 1964–1973
work page 2021
-
[37]
Z. Wan, Z. Li, M. Tian, J. Liu, S. Yi, H. Li, Encoder-decoder with multi-level atten- tion for 3D human shape and pose estimation, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 13033–13042. 31
work page 2021
-
[38]
X. Shen, Z. Yang, X. Wang, J. Ma, C. Zhou, Y . Yang, Global-to-local modeling for video-based 3D human pose and shape estimation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 8887–8896
work page 2023
-
[39]
P. Wu, X. Lu, J. Shen, Y . Yin, Clip fusion with bi-level optimization for human mesh reconstruction from monocular videos, in: Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023, pp. 105–115
work page 2023
-
[40]
M. Lee, H. b. Lee, B. Kim, S. Kim, Unspat: Uncertainty-guided spatiotemporal transformer for 3D human pose and shape estimation on videos, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (W ACV), 2024, pp. 3004–3013
work page 2024
-
[41]
T. Tang, H. Liu, Y . You, T. Wang, W. Li, Arts: Semi-analytical regressor using disentangled skeletal representations for human mesh recovery from videos, in: Proceedings of the 32th ACM International Conference on Multimedia (ACM MM), 2024, pp. 1514–1523
work page 2024
- [42]
-
[43]
R. Khirodkar, S. Tripathi, K. Kitani, Occluded human mesh recovery, in: Proceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 1715–1725
work page 2022
- [44]
-
[45]
Y . Yuan, U. Iqbal, P. Molchanov, K. Kitani, J. Kautz, Glamr: Global occlusion- aware human mesh recovery with dynamic cameras, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 32 2022, pp. 11038–11049
work page 2022
- [46]
- [47]
-
[48]
W. Yang, Z.-H. Jiang, S. Zhao, S. K. Zhou, Postometro: Pose token enhanced mesh transformer for robust 3D human mesh recovery, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 4746–4756
work page 2025
-
[49]
W. Guo, Y . Du, X. Shen, V . Lepetit, X. Alameda-Pineda, F. Moreno-Noguer, Back to mlp: A simple baseline for human motion prediction, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 4809–4819
work page 2023
-
[50]
I. O. Tolstikhin, N. Houlsby, A. Kolesnikov, L. Beyer, X. Zhai, T. Unterthiner, J. Yung, A. Steiner, D. Keysers, J. Uszkoreit, et al., Mlp-mixer: An all-mlp archi- tecture for vision, Advances in Neural Information Processing Systems (NeurIPS) 34 (2021) 24261–24272
work page 2021
-
[51]
N. Mahmood, N. Ghorbani, N. F. Troje, G. Pons-Moll, M. J. Black, Amass: Archive of motion capture as surface shapes, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 5442–5451
work page 2019
-
[52]
A. Kanazawa, J. Y . Zhang, P. Felsen, J. Malik, Learning 3D human dynamics from video, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 5614–5623
work page 2019
-
[53]
S. Shin, J. Kim, E. Halilaj, M. J. Black, Wham: Reconstructing world-grounded humans with accurate 3D motion, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 2070–2080
work page 2024
-
[54]
C. Ionescu, D. Papava, V . Olaru, C. Sminchisescu, Human3.6m: Large scale datasets and predictive methods for 3D human sensing in natural environments, 33 IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 36 (7) (2013) 1325–1339
work page 2013
-
[55]
W. Li, M. Liu, H. Liu, P. Wang, J. Cai, N. Sebe, Hourglass tokenizer for efficient transformer-based 3D human pose estimation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
work page 2024
-
[56]
W. Zhu, X. Ma, Z. Liu, L. Liu, W. Wu, Y . Wang, Motionbert: A unified perspective on learning human motion representations, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 15085–15099
work page 2023
- [57]
- [58]
- [59]
- [60]
- [61]
-
[62]
M. Fieraru, M. Zanfir, T. Szente, E. Bazavan, V . Olaru, C. Sminchisescu, Remips: Physically consistent 3D reconstruction of multiple interacting people under weak supervision, Advances in Neural Information Processing Systems (NeurIPS) 34 (2021) 19385–19397
work page 2021
-
[63]
A. Kanazawa, M. J. Black, D. W. Jacobs, J. Malik, End-to-end recovery of human shape and pose, in: Proceedings of the IEEE/CVF Conference on Computer Vision 34 and Pattern Recognition (CVPR), 2018, pp. 7122–7131
work page 2018
-
[64]
E.-T. Lê, A. Kakolyris, P. Koutras, H. Tam, E. Skordos, G. Papandreou, R. A. Güler, I. Kokkinos, Meshpose: Unifying densepose and 3D body mesh reconstruction, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 2405–2414
work page 2024
-
[65]
S. K. Dwivedi, Y . Sun, P. Patel, Y . Feng, M. J. Black, Tokenhmr: Advancing human mesh recovery with a tokenized pose representation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 1323–1333
work page 2024
-
[66]
Y .-P. Song, X. Wu, Z. Yuan, J.-J. Qiao, Q. Peng, Posturehmr: Posture transforma- tion for 3d human mesh recovery, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 9732–9741
work page 2024
-
[67]
Y . Xu, X. Ma, J. Su, W. Zhu, Y . Qiao, Y . Wang, Scorehypo: Probabilistic human mesh estimation with hypothesis scoring, in: Proceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 979–989
work page 2024
-
[68]
A. Kanazawa, J. Y . Zhang, P. Felsen, J. Malik, Learning 3D human dynamics from video, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 5614–5623. 35
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.