Purify-then-Align: Towards Robust Human Sensing under Modality Missing with Knowledge Distillation from Noisy Multimodal Teacher
Pith reviewed 2026-05-10 19:22 UTC · model grok-4.3
The pith
PTA first purifies multimodal knowledge by down-weighting noisy modalities with meta-learning, then aligns representations via diffusion distillation to build robust single-modality human sensing models.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The PTA framework solves the causal dependency between the representation gap and contamination effect by first employing a meta-learning-driven weighting mechanism that dynamically learns to down-weight the influence of noisy, low-contributing modalities. Subsequently, it introduces a diffusion-based knowledge distillation paradigm in which an information-rich clean teacher, formed from this purified consensus, refines the features of each student modality. The ultimate payoff is the creation of exceptionally powerful single-modality encoders imbued with cross-modal knowledge.
What carries the argument
The Purify-then-Align strategy that uses meta-learning to create a purified multimodal teacher before diffusion-based distillation transfers its knowledge to single-modality students.
If this is right
- Single-modality encoders acquire cross-modal knowledge and perform better when other modalities are missing.
- The approach yields state-of-the-art results and greater robustness across varied missing-modality scenarios on the MM-Fi and XRF55 datasets.
- The separation of purification from alignment decouples the two barriers, allowing each step to be optimized independently.
- The resulting models simplify real-world deployment by reducing dependence on simultaneous availability of all sensor types.
Where Pith is reading between the lines
- The same sequential purify-then-align logic could be tested on other multimodal tasks that suffer from variable data quality, such as video-audio fusion or sensor fusion in robotics.
- One could examine whether replacing the diffusion step with other alignment techniques preserves the robustness gains while changing computational cost.
- The framework implies that explicitly modeling the causal link between contamination and representation gaps may be useful in designing training pipelines for intermittent sensor systems.
Load-bearing premise
The meta-learning weighting mechanism can reliably identify and down-weight low-contributing modalities, and the diffusion distillation from the resulting purified consensus will reduce representation gaps without introducing new contamination.
What would settle it
A controlled test showing that single-modality models trained under PTA achieve no accuracy gain over standard training when one modality is known to be noisy and low-contributing would indicate that the purification or distillation steps failed.
Figures
read the original abstract
Robust multimodal human sensing must overcome the critical challenge of missing modalities. Two principal barriers are the Representation Gap between heterogeneous data and the Contamination Effect from low-quality modalities. These barriers are causally linked, as the corruption introduced by contamination fundamentally impedes the reduction of representation disparities. In this paper, we propose PTA, a novel "Purify-then-Align" framework that solves this causal dependency through a synergistic integration of meta-learning and knowledge diffusion. To purify the knowledge source, PTA first employs a meta-learning-driven weighting mechanism that dynamically learns to down-weight the influence of noisy, low-contributing modalities. Subsequently, to align different modalities, PTA introduces a diffusion-based knowledge distillation paradigm in which an information-rich clean teacher, formed from this purified consensus, refines the features of each student modality. The ultimate payoff of this "Purify-then-Align" strategy is the creation of exceptionally powerful single-modality encoders imbued with cross-modal knowledge. Comprehensive experiments on the large-scale MM-Fi and XRF55 datasets, under pronounced Representation Gap and Contamination Effect, demonstrate that PTA achieves state-of-the-art performance and significantly improves the robustness of single-modality models in diverse missing-modality scenarios.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes the PTA ('Purify-then-Align') framework for robust human sensing under missing modalities. It first uses a meta-learning-driven weighting mechanism to purify the multimodal teacher by down-weighting noisy or low-contributing modalities, then employs a diffusion-based knowledge distillation from the purified clean teacher to align the representations of individual modality students. The result is enhanced single-modality encoders that incorporate cross-modal knowledge. Experiments on MM-Fi and XRF55 datasets under various missing-modality scenarios demonstrate state-of-the-art performance and improved robustness.
Significance. If the empirical results hold, this work could have significant impact on multimodal machine learning for human sensing applications by addressing the linked problems of representation gaps and contamination effects through a sequential purify-then-align strategy. The combination of meta-learning for purification and diffusion for distillation offers a fresh approach that may generalize to other noisy multimodal settings. The paper provides comprehensive experiments on large-scale datasets, which is a strength.
major comments (2)
- §3.2, meta-learning weighting mechanism: the paper claims this dynamically down-weights low-contributing modalities to purify the teacher, but provides no ablation isolating its contribution versus a simple average or attention baseline; without this, it is unclear whether the weighting is load-bearing for the robustness gains or merely incidental.
- Table 3, high-contamination rows: PTA reports SOTA accuracy, yet the table omits standard deviations across runs and any statistical significance tests against the strongest baseline; this weakens the central claim that the purify-then-align sequence reliably overcomes contamination.
minor comments (3)
- Abstract: the quantitative improvements (e.g., absolute accuracy gains under 30-70% missing rates) are not stated, forcing readers to reach the results section to assess the magnitude of the contribution.
- §4.1: the description of missing-modality simulation protocols on MM-Fi and XRF55 could specify the exact random-seed strategy and modality dropout patterns to improve reproducibility.
- Figure 2: the diffusion distillation diagram is clear but the student-teacher feature dimensions and the number of diffusion steps are not annotated, creating a minor mismatch with the equations in §3.3.
Simulated Author's Rebuttal
We thank the referee for their thorough review and positive recommendation for minor revision. We appreciate the constructive comments and address each major point below. We will revise the manuscript accordingly to strengthen the empirical validation.
read point-by-point responses
-
Referee: §3.2, meta-learning weighting mechanism: the paper claims this dynamically down-weights low-contributing modalities to purify the teacher, but provides no ablation isolating its contribution versus a simple average or attention baseline; without this, it is unclear whether the weighting is load-bearing for the robustness gains or merely incidental.
Authors: We agree that an ablation study would help isolate the contribution of the meta-learning weighting mechanism. Although the design is motivated by the need to handle contamination effects dynamically (as opposed to static averaging or attention), we will add a new ablation in the revised manuscript. Specifically, we will compare PTA with variants using uniform modality averaging and a standard cross-attention baseline for the teacher purification step, reporting results on both MM-Fi and XRF55 under missing-modality conditions. This will demonstrate that the meta-learning component is essential for the performance gains. revision: yes
-
Referee: Table 3, high-contamination rows: PTA reports SOTA accuracy, yet the table omits standard deviations across runs and any statistical significance tests against the strongest baseline; this weakens the central claim that the purify-then-align sequence reliably overcomes contamination.
Authors: We acknowledge the importance of reporting variability and statistical significance for robust claims. In the revised manuscript, we will update Table 3 to include standard deviations over multiple runs (e.g., 5 seeds) for all compared methods in the high-contamination scenarios. Additionally, we will perform and report paired t-tests or Wilcoxon tests against the strongest baseline, with p-values, to confirm the statistical significance of PTA's improvements. This will better support the reliability of the purify-then-align approach in overcoming contamination. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The provided abstract and description outline a PTA framework that sequences meta-learning for modality weighting followed by diffusion-based distillation, without any equations, loss functions, or derivations shown that reduce by construction to fitted parameters, self-definitions, or self-citation chains. The central claims rely on standard techniques applied in a claimed novel order rather than on internal reductions or imported uniqueness theorems. No load-bearing steps match the enumerated circularity patterns, making the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
PTA first employs a meta-learning-driven weighting mechanism that dynamically learns to down-weight the influence of noisy, low-contributing modalities... diffusion-based knowledge distillation paradigm
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Tadas Baltru ˇsaitis, Chaitanya Ahuja, and Louis-Philippe Morency. Multimodal machine learning: A survey and tax- onomy.IEEE transactions on pattern analysis and machine intelligence, 41(2):423–443, 2018. 1
work page 2018
-
[2]
Multimodal fu- sion via teacher-student network for indoor action recogni- tion
XB Bruce, Yan Liu, and Keith CC Chan. Multimodal fu- sion via teacher-student network for indoor action recogni- tion. InProceedings of the AAAI conference on artificial intelligence, pages 3199–3207, 2021. 3
work page 2021
-
[3]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, and Yaser Sheikh. Openpose: Realtime multi-person 2d pose estimation using part affinity fields.IEEE transactions on pattern analysis and machine intelligence, 43(1):172–186,
-
[4]
Xinyan Chen and Jianfei Yang. X-fi: A modality-invariant foundation model for multimodal human sensing.arXiv preprint arXiv:2410.10167, 2025. 3, 5, 6, 7
-
[5]
Cheng Cheng, Wenzhe Liu, Zhaoxin Fan, Lin Feng, and Ziyu Jia. A novel transformer autoencoder for multi-modal emotion recognition with incomplete data.Neural Networks, 172:106111, 2024. 2
work page 2024
-
[6]
Avigyan Das, Pritam Sil, Pawan Kumar Singh, Vikrant Bhateja, and Ram Sarkar. Mmhar-ensemnet: A multi-modal human activity recognition model.IEEE Sensors Journal, 21 (10):11569–11576, 2021. 3, 6
work page 2021
-
[7]
milliflow: Scene flow estimation on mmwave radar point cloud for human motion sensing
Fangqiang Ding, Zhen Luo, Peijun Zhao, and Chris Xiaox- uan Lu. milliflow: Scene flow estimation on mmwave radar point cloud for human motion sensing. InEuropean Con- ference on Computer Vision, pages 202–221. Springer, 2024. 2
work page 2024
-
[8]
Han Ding, Zhenbin Chen, Cui Zhao, Fei Wang, Ge Wang, Wei Xi, and Jizhong Zhao. Mi-mesh: 3d human mesh con- struction by fusing image and millimeter wave.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 7(1):1–24, 2023. 2
work page 2023
-
[9]
Revisiting skeleton-based action recognition
Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, and Bo Dai. Revisiting skeleton-based action recognition. InPro- ceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2969–2978, 2022. 2
work page 2022
-
[10]
Masked autoencoders are scalable vision learners
Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Doll´ar, and Ross Girshick. Masked autoencoders are scalable vision learners. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16000– 16009, 2022. 2
work page 2022
-
[11]
Distilling the Knowledge in a Neural Network
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distill- ing the knowledge in a neural network.arXiv preprint arXiv:1503.02531, 2015. 4
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[12]
Denoising dif- fusion probabilistic models
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising dif- fusion probabilistic models. InAdvances in Neural Informa- tion Processing Systems, pages 6840–6851, 2020. 3, 4
work page 2020
-
[13]
Learn- ing with side information through modality hallucination
Judy Hoffman, Saurabh Gupta, and Trevor Darrell. Learn- ing with side information through modality hallucination. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 826–834, 2016. 2
work page 2016
-
[14]
Knowledge diffusion for distillation
Tao Huang, Yuan Zhang, Mingkai Zheng, Shan You, Fei Wang, Chen Qian, and Chang Xu. Knowledge diffusion for distillation. InAdvances in Neural Information Processing Systems, pages 65299–65316. Curran Associates, Inc., 2023. 4, 5
work page 2023
-
[15]
Mmact: A large-scale dataset for cross modal human action understanding
Quan Kong, Ziming Wu, Ziwei Deng, Martin Klinkigt, Bin Tong, and Tomokazu Murakami. Mmact: A large-scale dataset for cross modal human action understanding. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8658–8667, 2019. 2, 3
work page 2019
-
[16]
Min Gu Kwak, Lingchao Mao, Zhiyang Zheng, Yi Su, Flem- ing Lure, and Jing Li. A cross-modal mutual knowledge dis- tillation framework for alzheimer’s disease diagnosis: Ad- dressing incomplete modalities.IEEE Transactions on Au- tomation Science and Engineering, 2025. 2, 3
work page 2025
-
[17]
Bo Lan, Pei Li, Jiaxi Yin, Yunpeng Song, Ge Wang, Han Ding, Jinsong Han, and Fei Wang. Xrf v2: A dataset for action summarization with wi-fi signals, and imus in phones, watches, earbuds, and glasses.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 9(3):1–41, 2025. 2
work page 2025
-
[18]
Jin Li, Shoujin Wang, Qi Zhang, Shui Yu, and Fang Chen. Generating with fairness: A modality-diffused counterfac- tual framework for incomplete multimodal recommenda- tions. InProceedings of the ACM on Web Conference 2025, pages 2787–2798, 2025. 2
work page 2025
-
[19]
Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection
Yingwei Li, Adams Wei Yu, Tianjian Meng, Ben Caine, Ji- quan Ngiam, Daiyi Peng, Junyang Shen, Yifeng Lu, Denny Zhou, Quoc V Le, et al. Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 17182–17191, 2022. 1
work page 2022
-
[20]
arXiv preprint arXiv:2209.03430 (2022)
Paul Pu Liang, Amir Zadeh, and Louis-Philippe Morency. Foundations and trends in multimodal machine learning: Principles, challenges, and open questions.arXiv preprint arXiv:2209.03430, 2022. 2
-
[21]
Rui Liu, Haolin Zuo, Zheng Lian, Bj ¨orn W Schuller, and Haizhou Li. Contrastive learning based modality-invariant feature acquisition for robust multimodal emotion recogni- tion with missing modalities.IEEE Transactions on Affective Computing, 15(4):1856–1873, 2024. 2
work page 2024
-
[22]
Yanbei Liu, Lianxi Fan, Changqing Zhang, Tao Zhou, Zhi- tao Xiao, Lei Geng, and Dinggang Shen. Incomplete multi- modal representation learning for alzheimer’s disease diag- nosis.Medical Image Analysis, 69:101953, 2021. 1, 2
work page 2021
-
[23]
Yizhe Lv, Tingting Zhang, Zhijian Wang, Yunpeng Song, Han Ding, Jinsong Han, and Fei Wang. mmegohand: Egocentric hand pose estimation and gesture recognition with head-mounted millimeter-wave radar and imu.arXiv preprint arXiv:2501.13805, 2025. 2
-
[24]
Smil: Multimodal learning with severely missing modality
Mengmeng Ma, Jian Ren, Long Zhao, Sergey Tulyakov, Cathy Wu, and Xi Peng. Smil: Multimodal learning with severely missing modality. InProceedings of the AAAI Con- ference on Artificial Intelligence, pages 8827–8835, 2021. 1, 3
work page 2021
-
[25]
Fli- gan: Enhancing federated learning with incomplete data us- ing gan
Paul Joe Maliakel, Shashikant Ilager, and Ivona Brandic. Fli- gan: Enhancing federated learning with incomplete data us- ing gan. InProceedings of the 7th international workshop on edge systems, analytics and networking, pages 1–6, 2024. 2
work page 2024
-
[26]
Handling incomplete heterogeneous data us- ing vaes.Pattern Recognition, 107:107501, 2020
Alfredo Nazabal, Pablo M Olmos, Zoubin Ghahramani, and Isabel Valera. Handling incomplete heterogeneous data us- ing vaes.Pattern Recognition, 107:107501, 2020. 1
work page 2020
-
[27]
Muhammed Zahid Ozturk, Chenshu Wu, Beibei Wang, and KJ Ray Liu. Gaitcube: Deep data cube learning for human recognition with millimeter-wave radio.IEEE Internet of Things Journal, 9(1):546–557, 2021. 2
work page 2021
-
[28]
Training strate- gies to handle missing modalities for audio-visual expression recognition
Srinivas Parthasarathy and Shiva Sundaram. Training strate- gies to handle missing modalities for audio-visual expression recognition. InCompanion Publication of the 2020 Inter- national Conference on Multimodal Interaction, pages 400– 404, 2020. 2
work page 2020
-
[29]
Denois- ing diffusion implicit models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denois- ing diffusion implicit models. InInternational Conference on Learning Representations, 2021. 5
work page 2021
-
[30]
Timo von Marcard, Gerard Pons-Moll, and Bodo Rosen- hahn. Human pose estimation from video and imus.IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(8):1533–1547, 2016. 3
work page 2016
-
[31]
Con- cealed data poisoning attacks on nlp models
Eric Wallace, Tony Zhao, Shi Feng, and Sameer Singh. Con- cealed data poisoning attacks on nlp models. InProceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Lan- guage Technologies, pages 139–150, 2021. 8
work page 2021
-
[32]
Fei Wang, Jianwei Feng, Yinliang Zhao, Xiaobin Zhang, Shiyuan Zhang, and Jinsong Han. Joint activity recognition and indoor localization with wifi fingerprints.Ieee Access, 7:80058–80068, 2019. 2
work page 2019
-
[33]
Person-in-wifi: Fine-grained person percep- tion using wifi
Fei Wang, Sanping Zhou, Stanislav Panev, Jinsong Han, and Dong Huang. Person-in-wifi: Fine-grained person percep- tion using wifi. InProceedings of the IEEE/CVF inter- national conference on computer vision, pages 5452–5461,
-
[34]
Fei Wang, Yizhe Lv, Mengdie Zhu, Han Ding, and Jinsong Han. Xrf55: A radio frequency dataset for human indoor action analysis.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 8(1):1–34,
-
[35]
Fei Wang, Tingting Zhang, Wei Xi, Han Ding, Ge Wang, Di Zhang, Yuanhao Cui, Fan Liu, Jinsong Han, Jie Xu, et al. A survey on wi-fi sensing generalizability: Taxonomy, tech- niques, datasets, and future research prospects.IEEE Com- munications Surveys & Tutorials, 2026. 8
work page 2026
-
[36]
Multi-modal learning with missing modality via shared-specific feature modelling
Hu Wang, Salma Hassan, Yuyuan Liu, Congbo Ma, Yuan- hong Chen, Qing Li, Jiahui Geng, Bingjie Wang, Yu Tian, Yutong Xie, et al. Meta-learned modality-weighted knowl- edge distillation for robust multi-modal learning with miss- ing data.arXiv preprint arXiv:2405.07155, 2024. 3
-
[37]
Qianqian Wang, Huanhuan Lian, Gan Sun, Quanxue Gao, and Licheng Jiao. icmsc: Incomplete cross-modal subspace clustering.IEEE Transactions on Image Processing, 30:305– 317, 2020. 1, 2
work page 2020
-
[38]
Multimodal learning with incomplete modalities by knowl- edge distillation
Qi Wang, Liang Zhan, Paul Thompson, and Jiayu Zhou. Multimodal learning with incomplete modalities by knowl- edge distillation. InProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1828–1838, 2020. 2, 3
work page 2020
-
[39]
Yuanzhi Wang, Yong Li, and Zhen Cui. Incomplete multimodality-diffused emotion recognition.Advances in Neural Information Processing Systems, 36:17117–17128,
-
[40]
Deep Multimodal Learning with Missing Modality: A Survey
Renjie Wu, Hu Wang, Hsiang-Ting Chen, and Gustavo Carneiro. Deep multimodal learning with missing modality: A survey.arXiv preprint arXiv:2409.07825, 2024. 1
work page internal anchor Pith review arXiv 2024
-
[41]
Leveraging knowledge of modality experts for incomplete multimodal learning
Wenxin Xu, Hexin Jiang, and Xuefeng Liang. Leveraging knowledge of modality experts for incomplete multimodal learning. InProceedings of the 32nd ACM International Conference on Multimedia, pages 438–446, 2024. 1, 2
work page 2024
-
[42]
mmmesh: To- wards 3d real-time dynamic human mesh construction using millimeter-wave
Hongfei Xue, Yan Ju, Chenglin Miao, Yijiang Wang, Shiyang Wang, Aidong Zhang, and Lu Su. mmmesh: To- wards 3d real-time dynamic human mesh construction using millimeter-wave. InProceedings of the 19th Annual Inter- national Conference on Mobile Systems, Applications, and Services, pages 269–282, 2021. 2
work page 2021
-
[43]
Zihui Xue, Zhengqi Gao, Sucheng Ren, and Hang Zhao. The modality focusing hypothesis: Towards understand- ing crossmodal knowledge distillation.arXiv preprint arXiv:2206.06487, 2022. 2, 3
-
[44]
Person-in-wifi 3d: End-to-end multi- person 3d pose estimation with wi-fi
Kangwei Yan, Fei Wang, Bo Qian, Han Ding, Jinsong Han, and Xing Wei. Person-in-wifi 3d: End-to-end multi- person 3d pose estimation with wi-fi. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 969–978, 2024. 2
work page 2024
-
[45]
Yunlu Yan, Chun-Mei Feng, Yuexiang Li, Ping Li, Rick Siow Mong Goh, Baiying Lei, Weiming Wang, David Da- gan Feng, and Lei Zhu. Federated pseudo modality gener- ation for incomplete multi-modal mri reconstruction.IEEE Journal of Biomedical and Health Informatics, 2025. 2
work page 2025
-
[46]
Jianfei Yang, He Huang, Yunjiao Zhou, Xinyan Chen, Yue- cong Xu, Shenghai Yuan, Han Zou, Chris Xiaoxuan Lu, and Lihua Xie. Mm-fi: Multi-modal non-intrusive 4d human dataset for versatile wireless sensing.Advances in Neural Information Processing Systems, 36:18756–18768, 2023. 2, 5, 6, 7
work page 2023
-
[47]
Yanwu Yang, Hairui Chen, Zhikai Chang, Yang Xiang, Chenfei Ye, and Ting Ma. Incomplete learning of multi- modal connectome for brain disorder diagnosis via modal- mixup and deep supervision. InMedical Imaging With Deep Learning, pages 1006–1018. PMLR, 2024. 2
work page 2024
-
[48]
Ronghui Zhang, Chunxiao Jiang, Sheng Wu, Quan Zhou, Xi- aojun Jing, and Junsheng Mu. Wi-fi sensing for joint gesture recognition and human identification from few samples in human-computer interaction.IEEE Journal on Selected Ar- eas in Communications, 40(7):2193–2205, 2022. 2
work page 2022
-
[49]
Through-wall human pose estimation using radio signals
Mingmin Zhao, Tianhong Li, Mohammad Abu Alsheikh, Yonglong Tian, Hang Zhao, Antonio Torralba, and Dina Katabi. Through-wall human pose estimation using radio signals. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 7356–7365, 2018. 2
work page 2018
-
[50]
Through-wall human mesh recovery using radio signals
Mingmin Zhao, Yingcheng Liu, Aniruddh Raghu, Tian- hong Li, Hang Zhao, Antonio Torralba, and Dina Katabi. Through-wall human mesh recovery using radio signals. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10113–10122, 2019. 2
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.