Diverse Yet Consistent: Context-Guided Diffusion with Energy-Based Joint Refinement for Multi-Agent Motion Prediction
Pith reviewed 2026-05-22 07:11 UTC · model grok-4.3
The pith
A context-guided diffusion process generates diverse multi-agent motions that energy-based refinement then adjusts to enforce interaction consistency.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By embedding contextual information from past trajectories into the diffusion sampling steps, the model produces a diverse set of candidate trajectories; an energy-based term is then used to reshape the joint distribution so that the selected trajectories satisfy interaction consistency while leaving the individual trajectory plausibilities largely intact.
What carries the argument
Context-guided diffusion sampling followed by energy-based refinement of the joint trajectory distribution.
If this is right
- Marginal prediction errors (ADE/FDE) on ETH/UCY drop below those of strong single-agent baselines.
- Joint errors (JADE/JFDE) also improve over marginal baselines while remaining competitive with earlier joint methods.
- The same framework delivers larger marginal gains than prior joint-prediction techniques without sacrificing joint performance.
Where Pith is reading between the lines
- The same two-stage pattern could be tested on vehicle or drone trajectory data where interaction rules differ from pedestrian crowds.
- Replacing the energy term with a learned critic might allow the refinement stage to adapt to new scene types without hand-designed potentials.
- Extending the diffusion guidance to include map or semantic context could further increase diversity while the energy stage still enforces physical constraints.
Load-bearing premise
The energy-based step can adjust the joint distribution for interaction consistency without making the individual trajectories less plausible than those produced by the diffusion stage alone.
What would settle it
On the ETH or UCY datasets, run the method and check whether joint metrics such as JADE or JFDE improve while marginal ADE/FDE stay at least as good as the diffusion-only version; if either the joint scores fail to rise or the individual paths become visibly unrealistic, the central claim would be undercut.
Figures
read the original abstract
Deepgenerative models havebecomeapromisingapproach for human motion prediction due to their ability to capture multimodal distributions and represent diverse human be haviors. However, generating predictions that are both di verse and jointly consistent among interacting agents re mains challenging. In addition, most existing approaches are primarily evaluated using single-agent (marginal) met rics, which fail to fully reflect the joint dynamics of multi agent interactions. We propose a diffusion-based frame work that improves multi-agent motion prediction by lever aging rich contextual information from historical trajecto ries. This information is incorporated through a guidance mechanism to enhance the diversity and expressiveness of predicted motions. To further enforce interaction consis tency, we introduce an energy-based formulation that re fines the joint trajectory distribution while preserving the plausibility of individual trajectories. Extensive experi ments on four benchmark datasets demonstrate that our approach consistently outperforms existing methods. No tably, our approach substantially improves both marginal (ADE/FDE) and joint (JADE/JFDE) metrics on ETH/UCY over strong marginal baselines. Compared with prior joint prediction methods, it delivers significant gains in marginal metrics while maintaining competitive joint performance.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a diffusion-based framework for multi-agent motion prediction that leverages contextual information from historical trajectories via a guidance mechanism to enhance diversity and expressiveness. It introduces an energy-based formulation for joint trajectory refinement to enforce interaction consistency while preserving individual trajectory plausibility. The approach is evaluated on four benchmark datasets, showing consistent outperformance, with substantial improvements in marginal (ADE/FDE) and joint (JADE/JFDE) metrics on ETH/UCY over strong baselines, and gains in marginal metrics compared to prior joint methods.
Significance. If the results hold, this work would be significant for advancing multi-agent motion prediction by addressing the trade-off between diversity and joint consistency, which is key for applications in autonomous driving and human-robot interaction. The combination of context-guided diffusion and energy-based refinement provides a novel way to generate diverse yet consistent predictions, potentially outperforming methods focused solely on marginal or joint metrics. The reported improvements on standard benchmarks suggest practical relevance.
major comments (2)
- [Abstract and §4 Experiments] Abstract and §4 Experiments: The central claim that the energy-based joint refinement enforces interaction consistency without harming the plausibility or diversity of individual trajectories is load-bearing for the title and reported gains over marginal baselines. However, the manuscript provides no quantitative before-vs-after comparison of marginal ADE/FDE (or diversity measures such as mode coverage) after the refinement step. This leaves open the possibility that the energy term over-penalizes valid but less common interaction modes, directly undermining the 'diverse yet consistent' claim.
- [§3.3 Energy-based joint refinement] §3.3 Energy-based joint refinement: The exact mathematical form of the energy function, its weighting relative to the diffusion score, and the sampling procedure used for refinement are not specified with sufficient precision to evaluate whether the refinement step can be guaranteed to preserve marginal plausibility. Without this, it is impossible to assess the risk that the joint consistency term collapses modes or pushes trajectories into low-probability regions.
minor comments (2)
- [Abstract] The abstract contains multiple typographical and spacing errors (e.g., 'Deepgenerative models havebecomeapromisingapproach', 'be haviors'). These should be corrected.
- [Throughout] Ensure first use of acronyms (ADE, FDE, JADE, JFDE) is accompanied by their full definitions.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive feedback on our manuscript. We address each of the major comments below and outline the revisions we plan to make to strengthen the paper.
read point-by-point responses
-
Referee: [Abstract and §4 Experiments] Abstract and §4 Experiments: The central claim that the energy-based joint refinement enforces interaction consistency without harming the plausibility or diversity of individual trajectories is load-bearing for the title and reported gains over marginal baselines. However, the manuscript provides no quantitative before-vs-after comparison of marginal ADE/FDE (or diversity measures such as mode coverage) after the refinement step. This leaves open the possibility that the energy term over-penalizes valid but less common interaction modes, directly undermining the 'diverse yet consistent' claim.
Authors: We acknowledge that a quantitative before-and-after analysis would provide stronger evidence for our claim. To address this, we will add a new ablation study in Section 4 of the revised manuscript. This study will report marginal ADE/FDE and diversity metrics (such as the number of modes covered) computed on the trajectories before and after the energy-based joint refinement. We expect this to show that the refinement step improves joint metrics while maintaining or even enhancing marginal performance and diversity, consistent with our overall results on the benchmarks. revision: yes
-
Referee: [§3.3 Energy-based joint refinement] §3.3 Energy-based joint refinement: The exact mathematical form of the energy function, its weighting relative to the diffusion score, and the sampling procedure used for refinement are not specified with sufficient precision to evaluate whether the refinement step can be guaranteed to preserve marginal plausibility. Without this, it is impossible to assess the risk that the joint consistency term collapses modes or pushes trajectories into low-probability regions.
Authors: We agree that the description in §3.3 could be more precise to allow full evaluation and reproducibility. In the revised manuscript, we will expand §3.3 to include the exact mathematical formulation of the energy function E(·), the specific weighting parameter λ used to balance it with the diffusion score, and the details of the sampling procedure (e.g., the number of refinement steps and the optimization method employed). This will enable readers to assess the preservation of marginal plausibility and the risk of mode collapse. revision: yes
Circularity Check
No circularity: method builds on standard diffusion and energy-based models with empirical validation
full rationale
The paper presents a context-guided diffusion process for diverse multi-agent motion predictions followed by an energy-based joint refinement step to enforce interaction consistency. No equations, derivations, or self-citations are visible in the provided abstract or description that reduce any claimed result to its own inputs by construction. The approach is described as leveraging established diffusion guidance mechanisms and energy-based formulations without self-referential fitting, parameter renaming, or load-bearing uniqueness theorems from the authors' prior work. Performance claims rest on experimental comparisons against baselines on ETH/UCY and other benchmarks rather than tautological predictions, rendering the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
invented entities (1)
-
Energy-based joint refinement
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we introduce an energy-based formulation that refines the joint trajectory distribution while preserving the plausibility of individual trajectories... Pjoint(Y|G) ∝ P Diff(Y|G) exp(−Eθ(Y,G))
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Joint Distribution Refinement (JDR), which shifts probability mass toward jointly consistent trajectories
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Tripod: Human trajectory and pose dynamics forecasting in the wild
Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Car- los Niebles, Silvio Savarese, Ehsan Adeli, and Hamid Rezatofighi. Tripod: Human trajectory and pose dynamics forecasting in the wild. InICCV, pages 13390–13400, 2021. 2, 3
work page 2021
-
[2]
So- cial lstm: Human trajectory prediction in crowded spaces
Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, and Silvio Savarese. So- cial lstm: Human trajectory prediction in crowded spaces. In CVPR, pages 961–971, 2016. 1, 2
work page 2016
-
[3]
Non- probability sampling network for stochastic human trajectory prediction
Inhwan Bae, Jin-Hwi Park, and Hae-Gon Jeon. Non- probability sampling network for stochastic human trajectory prediction. InCVPR, pages 6477–6487, 2022. 6
work page 2022
-
[4]
Eigentrajectory: Low-rank descriptors for multi-modal trajectory forecasting
Inhwan Bae, Jean Oh, and Hae-Gon Jeon. Eigentrajectory: Low-rank descriptors for multi-modal trajectory forecasting. InICCV, pages 10017–10029, 2023. 6
work page 2023
-
[5]
Singu- lartrajectory: Universal trajectory predictor using diffusion model
Inhwan Bae, Young-Jae Park, and Hae-Gon Jeon. Singu- lartrajectory: Universal trajectory predictor using diffusion model. InCVPR, pages 17890–17901, 2024. 1, 5, 6
work page 2024
-
[6]
Pointwise: Predicting points and valuing deci- sions in real time with nba optical tracking data
Dan Cervone, Alexander D’amour, Luke Bornn, and Kirk Goldsberry. Pointwise: Predicting points and valuing deci- sions in real time with nba optical tracking data. InPro- ceedings of the 8th MIT Sloan Sports Analytics Conference, Boston, MA, USA, 2014. 6
work page 2014
-
[7]
Context-conditioned spatio-temporal predictive learning for reliable v2v channel prediction.TITS,
Lei Chu, Daoud Burghal, Rui Wang, Michael Neuman, and Andreas F Molisch. Context-conditioned spatio-temporal predictive learning for reliable v2v channel prediction.TITS,
-
[8]
Lookout: Diverse multi-future predic- tion and planning for self-driving
Alexander Cui, Sergio Casas, Abolfazl Sadat, Renjie Liao, and Raquel Urtasun. Lookout: Diverse multi-future predic- tion and planning for self-driving. InCVPR, pages 16107– 16116, 2021. 1
work page 2021
-
[9]
Nachiket Deo and Mohan M. Trivedi. Convolutional social pooling for vehicle trajectory prediction. InCVPR, pages 1468–1476, 2018. 1
work page 2018
-
[10]
Diffusion models beat gans on image synthesis
Prafulla Dhariwal and Alexander Nichol. Diffusion models beat gans on image synthesis. InNeurIPS, pages 8780–8794,
-
[11]
Implicit generation and mod- eling with energy based models.NeurIPS, 32, 2019
Yilun Du and Igor Mordatch. Implicit generation and mod- eling with energy based models.NeurIPS, 32, 2019. 2, 5
work page 2019
-
[12]
Neuralized markov random field for interaction-aware stochastic human trajectory prediction
Zilin Fang, David Hsu, Gim Hee Lee, and Gim Hee Lee. Neuralized markov random field for interaction-aware stochastic human trajectory prediction. InICLR, 2025. 2, 3, 6, 7
work page 2025
-
[13]
Yuxiang Fu, Qi Yan, Lele Wang, Ke Li, and Renjie Liao. Moflow: One-step flow matching for human trajectory fore- casting via implicit maximum likelihood estimation based distillation. InCVPR, pages 17282–17293, 2025. 1, 5, 6, 7
work page 2025
-
[14]
Transformer networks for trajectory forecasting
Francesco Giuliari, Irtiza Hasan, Marco Cristani, and Fabio Galasso. Transformer networks for trajectory forecasting. In ICPR, pages 10335–10342. IEEE, 2021. 2
work page 2021
-
[15]
Social gan: Socially acceptable trajec- tories with generative adversarial networks
Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. Social gan: Socially acceptable trajec- tories with generative adversarial networks. InCVPR, pages 2255–2264, 2018. 6, 7
work page 2018
-
[16]
Classifier-free diffusion guidance.ICLR, 2022
Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance.ICLR, 2022. 2, 4
work page 2022
-
[17]
Denoising dif- fusion probabilistic models.NeurIPS, 33:6840–6851, 2020
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising dif- fusion probabilistic models.NeurIPS, 33:6840–6851, 2020. 2
work page 2020
-
[18]
Action-reaction: Fore- casting the dynamics of human interaction
De-An Huang and Kris M Kitani. Action-reaction: Fore- casting the dynamics of human interaction. InECCV, pages 489–504. Springer, 2014. 2, 3
work page 2014
-
[19]
Stgat: Modeling spatial-temporal interactions for human trajectory prediction
Yingfan Huang, Huikun Bi, Zhaoxin Li, Tianlu Mao, and Zhaoqi Wang. Stgat: Modeling spatial-temporal interactions for human trajectory prediction. InCVPR, pages 6272–6281,
-
[20]
An edit friendly ddpm noise space: Inversion and manipulations
Inbar Huberman-Spiegelglas, Vladimir Kulikov, and Tomer Michaeli. An edit friendly ddpm noise space: Inversion and manipulations. InCVPR, pages 12469–12478, 2024. 2
work page 2024
-
[21]
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner, Yilun Du, Joshua B Tenenbaum, and Sergey Levine. Planning with diffusion for flexible behavior synthe- sis.arXiv preprint arXiv:2205.09991, 2022. 2
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[22]
Multi-agent long-term 3d human pose forecasting via interaction-aware trajectory conditioning
Jaewoo Jeong, Daehee Park, and Kuk-Jin Yoon. Multi-agent long-term 3d human pose forecasting via interaction-aware trajectory conditioning. InCVPR, pages 1617–1628, 2024. 1
work page 2024
-
[23]
Multi-modal knowledge distillation-based human trajectory forecasting
Jaewoo Jeong, Seohee Lee, Daehee Park, Giwon Lee, and Kuk-Jin Yoon. Multi-modal knowledge distillation-based human trajectory forecasting. InCVPR, pages 24222–24233,
-
[24]
Motiondiffuser: Controllable multi-agent motion prediction using diffusion
Chiyu Jiang, Andre Cornman, Cheolho Park, Benjamin Sapp, Yin Zhou, Dragomir Anguelov, et al. Motiondiffuser: Controllable multi-agent motion prediction using diffusion. InCVPR, pages 9644–9653, 2023. 1
work page 2023
-
[25]
Human trajectory forecasting in crowds: A deep learning perspec- tive.TITS, 23(7):7386–7400, 2021
Parth Kothari, Sven Kreiss, and Alexandre Alahi. Human trajectory forecasting in crowds: A deep learning perspec- tive.TITS, 23(7):7386–7400, 2021. 1
work page 2021
-
[26]
Bcdiff: Bidirectional consistent diffusion for instantaneous trajectory prediction
Rongqing Li, Changsheng Li, Dongchun Ren, Guangyi Chen, Ye Yuan, and Guoren Wang. Bcdiff: Bidirectional consistent diffusion for instantaneous trajectory prediction. NeurIPS, 36:14400–14413, 2023. 1
work page 2023
-
[27]
Joint pedestrian tra- jectory prediction through posterior sampling
Haotian Lin, Yixiao Wang, Mingxiao Huo, Chensheng Peng, Zhiyuan Liu, and Masayoshi Tomizuka. Joint pedestrian tra- jectory prediction through posterior sampling. InProceed- ings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5672–5679, 2024. 6, 7
work page 2024
-
[28]
Progressive pretext task learning for human trajec- tory prediction
Xiaotong Lin, Tianming Liang, Jianhuang Lai, and Jian- Fang Hu. Progressive pretext task learning for human trajec- tory prediction. InECCV, pages 197–214. Springer, 2024. 2, 5
work page 2024
-
[29]
A structured self-attentive sentence embedding
Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. A structured self-attentive sentence embedding. InICLR, 2017. 3
work page 2017
-
[30]
Dongrui Liu, Chuanchaun Chen, Changqing Xu, Robert C Qiu, and Lei Chu. Self-supervised point cloud registration with deep versatile descriptors for intelligent driving.TITS, 24(9):9767–9779, 2023. 1
work page 2023
-
[31]
It is not the journey but the destination: Endpoint conditioned trajectory prediction
Karttikeya Mangalam, Harshayu Girase, Shreyas Agarwal, Kuan-Hui Lee, Ehsan Adeli, Jitendra Malik, and Adrien Gaidon. It is not the journey but the destination: Endpoint conditioned trajectory prediction. InECCV, pages 759–776,
-
[32]
From goals, waypoints & paths to long term human trajectory forecasting
Karttikeya Mangalam, Yang An, Harshayu Girase, and Ji- tendra Malik. From goals, waypoints & paths to long term human trajectory forecasting. InCVPR, pages 15233–15242,
-
[33]
Leapfrog diffusion model for stochastic trajec- tory prediction
Weibo Mao, Chenxin Xu, Qi Zhu, Siheng Chen, and Yan- feng Wang. Leapfrog diffusion model for stochastic trajec- tory prediction. InCVPR, pages 5517–5526, 2023. 1, 6
work page 2023
-
[34]
Forecasting hu- man trajectory from scene history
Mancheng Meng, Ziyan Wu, Terrence Chen, Xiran Cai, Xi- ang Zhou, Fan Yang, and Dinggang Shen. Forecasting hu- man trajectory from scene history. InNeurIPS, pages 24920– 24933, 2022. 1, 2
work page 2022
-
[35]
Abduallah Mohamed, Kun Qian, Mohamed Elhoseiny, and Christian Claudel. Social-stgcnn: A social spatio-temporal graph convolutional neural network for human trajectory prediction. InCVPR, pages 14424–14432, 2020. 1, 2, 6
work page 2020
-
[36]
Tra- jectory prediction with latent belief energy-based model
Bo Pang, Tianyang Zhao, Xu Xie, and Ying Nian Wu. Tra- jectory prediction with latent belief energy-based model. In CVPR, pages 11814–11824, 2021. 2, 5
work page 2021
-
[37]
Stirnet: A spatial-temporal interaction-aware re- cursive network for human trajectory prediction
Yusheng Peng, Gaofeng Zhang, Xiangyu Li, and Liping Zheng. Stirnet: A spatial-temporal interaction-aware re- cursive network for human trajectory prediction. InCVPR, pages 2285–2293, 2021. 2
work page 2021
-
[38]
Shaojie Qiao, Dayong Shen, Xiaoteng Wang, Nan Han, and William Zhu. A self-adaptive parameter selection trajectory prediction approach via hidden markov models.TITS, 16(1): 284–296, 2014. 2
work page 2014
-
[39]
Efficient learning of sparse representations with an energy-based model.NeurIPS, 19, 2006
Marc’Aurelio Ranzato, Christopher Poultney, Sumit Chopra, and Yann Cun. Efficient learning of sparse representations with an energy-based model.NeurIPS, 19, 2006. 2, 5
work page 2006
-
[40]
Trace and pace: Controllable pedestrian animation via guided tra- jectory diffusion
Davis Rempe, Zhengyi Luo, Xue Bin Peng, Ye Yuan, Kris Kitani, Karsten Kreis, Sanja Fidler, and Or Litany. Trace and pace: Controllable pedestrian animation via guided tra- jectory diffusion. InCVPR, pages 13756–13766, 2023. 2, 4
work page 2023
-
[41]
Learning social etiquette: Human tra- jectory understanding in crowded scenes
Alexandre Robicquet, Amir Sadeghian, Alexandre Alahi, and Silvio Savarese. Learning social etiquette: Human tra- jectory understanding in crowded scenes. InECCV, pages 549–565. Springer, 2016. 6
work page 2016
-
[42]
Joint long-term prediction of human motion using a planning- based social force approach
Andrey Rudenko, Luigi Palmieri, and Kai O Arras. Joint long-term prediction of human motion using a planning- based social force approach. InICLR, pages 4571–4577. IEEE, 2018. 2
work page 2018
-
[43]
Social-transmotion: Promptable human trajectory prediction
Saeed Saadatnejad, Yang Gao, Kaouther Messaoud, and Alexandre Alahi. Social-transmotion: Promptable human trajectory prediction. InICLR, 2023. 6
work page 2023
-
[44]
Saeed Saadatnejad, Yang Gao, Hamid Rezatofighi, and Alexandre Alahi. Jrdb-traj: A dataset and benchmark for trajectory forecasting in crowds.arXiv preprint arXiv:2311.02736, 2023. 6
-
[45]
Progressive Distillation for Fast Sampling of Diffusion Models
Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models.arXiv preprint arXiv:2202.00512, 2022. 2
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[46]
Trajectron++: Dynamically-feasible trajec- tory forecasting with heterogeneous data
Tim Salzmann, Boris Ivanovic, Punarjay Chakravarty, and Marco Pavone. Trajectron++: Dynamically-feasible trajec- tory forecasting with heterogeneous data. InECCV, pages 683–700. Springer, 2020. 1, 6, 7
work page 2020
-
[47]
Pose forecasting in industrial human-robot collaboration
Alessio Sampieri, Guido Maria D’Amely di Melendugno, Andrea Avogaro, Federico Cunico, Francesco Setti, Geri Sk- enderi, Marco Cristani, and Fabio Galasso. Pose forecasting in industrial human-robot collaboration. InECCV, pages 51–
-
[48]
Tra- jectory unified transformer for pedestrian trajectory predic- tion
Liushuai Shi, Le Wang, Sanping Zhou, and Gang Hua. Tra- jectory unified transformer for pedestrian trajectory predic- tion. InICCV, pages 9675–9684, 2023. 2, 6
work page 2023
-
[49]
Pip: Planning- informed trajectory prediction for autonomous driving
Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, and Qifeng Chen. Pip: Planning- informed trajectory prediction for autonomous driving. In ECCV, pages 598–614. Springer, 2020. 1
work page 2020
-
[50]
Social diffusion: Long-term multiple hu- man motion anticipation
Julian Tanke, Linguang Zhang, Amy Zhao, Chengcheng Tang, Yujun Cai, Lezi Wang, Po-Chen Wu, Juergen Gall, and Cem Keskin. Social diffusion: Long-term multiple hu- man motion anticipation. InICCV, pages 9601–9611, 2023. 1
work page 2023
-
[51]
Ana- lyzing the variety loss in the context of probabilistic trajec- tory prediction
Luca Anthony Thiede and Pratik Prabhanjan Brahma. Ana- lyzing the variety loss in the context of probabilistic trajec- tory prediction. InICCV, pages 9954–9963, 2019. 1
work page 2019
-
[52]
An uncertain future: Forecasting from static images using variational autoencoders
Jacob Walker, Carl Doersch, Abhinav Gupta, and Martial Hebert. An uncertain future: Forecasting from static images using variational autoencoders. InECCV, pages 835–851. Springer, 2016. 1
work page 2016
-
[53]
Graphtcn: Spatio-temporal interaction modeling for human trajectory prediction
Chengxin Wang, Shaofeng Cai, and Gary Tan. Graphtcn: Spatio-temporal interaction modeling for human trajectory prediction. InWACV, pages 3450–3459, 2021. 2, 3
work page 2021
-
[54]
Joint metrics matter: A better standard for trajectory forecasting
Erica Weng, Hana Hoshino, Deva Ramanan, and Kris Ki- tani. Joint metrics matter: A better standard for trajectory forecasting. InICCV, pages 20315–20326, 2023. 2, 6, 7
work page 2023
-
[55]
View vertically: A hierarchical network for trajectory prediction via fourier spectrums
Conghao Wong, Beihao Xia, Ziming Hong, Qinmu Peng, Wei Yuan, Qiong Cao, Yibo Yang, and Xinge You. View vertically: A hierarchical network for trajectory prediction via fourier spectrums. InECCV, pages 682–700, 2020. 1, 6, 7
work page 2020
-
[56]
Songpengcheng Xia, Lei Chu, Ling Pei, Jiarui Yang, Wenx- ian Yu, and Robert C Qiu. Timestamp-supervised wearable- based activity segmentation and recognition with contrastive learning and order-preserving optimal transport.TMC, 23 (12):10734–10751, 2024. 1
work page 2024
-
[57]
Songpengcheng Xia, Yu Zhang, Zhuo Su, Xiaozheng Zheng, Zheng Lv, Guidong Wang, Yongjie Zhang, Qi Wu, Lei Chu, and Ling Pei. Envposer: Environment-aware realistic human motion estimation from sparse observations with uncertainty modeling. InProceedings of the Computer Vision and Pat- tern Recognition Conference, pages 1839–1849, 2025. 1
work page 2025
-
[58]
Chenxin Xu, Maosen Li, Zhenyang Ni, Ya Zhang, and Si- heng Chen. Groupnet: Multiscale hypergraph neural net- works for trajectory prediction with relational reasoning. In CVPR, pages 6498–6507, 2022. 6
work page 2022
-
[59]
Remember intentions: Retrospective-memory-based trajec- tory prediction
Chenxin Xu, Weibo Mao, Wenjun Zhang, and Siheng Chen. Remember intentions: Retrospective-memory-based trajec- tory prediction. InCVPR, pages 6488–6497, 2022. 6, 7
work page 2022
-
[60]
Eq- motion: Equivariant multi-agent motion prediction with in- variant interaction reasoning
Chenxin Xu, Robby T Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, and Yanfeng Wang. Eq- motion: Equivariant multi-agent motion prediction with in- variant interaction reasoning. InCVPR, pages 1410–1420,
-
[61]
So- cialvae: Human trajectory prediction using timewise latents
Pei Xu, Jean-Bernard Hayet, and Ioannis Karamouzas. So- cialvae: Human trajectory prediction using timewise latents. InECCV, pages 511–528. Springer, 2022. 1, 6
work page 2022
-
[62]
Hao Xue, Flora Salim, Yongli Ren, and Nuria Oliver. Mobt- cast: Leveraging auxiliary trajectory forecasting for human mobility prediction.NeurIPS, 34:30380–30391, 2021. 3
work page 2021
-
[63]
Dlow: Diversifying latent flows for diverse human motion prediction
Ye Yuan and Kris Kitani. Dlow: Diversifying latent flows for diverse human motion prediction. InEuropean Conference on Computer Vision, pages 346–364. Springer, 2020. 2
work page 2020
-
[64]
Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting
Ye Yuan, Xinshuo Weng, Yanglan Ou, and Kris M Kitani. Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting. InICCV, pages 9813–9823, 2021. 3, 6
work page 2021
-
[65]
Physdiff: Physics-guided human motion diffusion model
Ye Yuan, Jiaming Song, Umar Iqbal, Arash Vahdat, and Jan Kautz. Physdiff: Physics-guided human motion diffusion model. InICCV, pages 16010–16021, 2023. 2
work page 2023
-
[66]
Yu Zhang, Songpengcheng Xia, Lei Chu, Jiarui Yang, Qi Wu, and Ling Pei. Dynamic inertial poser (dynaip): Part- based motion dynamics learning for enhanced human pose estimation with sparse inertial sensors. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1889–1899, 2024. 1
work page 2024
-
[67]
Multi-agent tensor fusion for contextual trajectory predic- tion
Tianyang Zhao, Yifei Xu, Mathew Monfort, Wongun Choi, Chris Baker, Yibiao Zhao, Yizhou Wang, and Ying Nian Wu. Multi-agent tensor fusion for contextual trajectory predic- tion. InCVPR, pages 12126–12134, 2019. 1
work page 2019
-
[68]
Query-centric trajectory prediction
Zikang Zhou, Jianping Wang, Yung-Hui Li, and Yu-Kai Huang. Query-centric trajectory prediction. InCVPR, pages 17863–17873, 2023. 2
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.