Stochastic Prediction of Multi-Agent Interactions from Partial Observations

Chen Sun , Per Karlsson , Jiajun Wu , Joshua B Tenenbaum , Kevin Murphy

Authors on Pith no claims yet

classification 💻 cs.LG cs.CVstat.ML

keywords methodinformationlearnedmodelagentsambiguousbaselinesbasketball

read the original abstract

We present a method that learns to integrate temporal information, from a learned dynamics model, with ambiguous visual information, from a learned vision model, in the context of interacting agents. Our method is based on a graph-structured variational recurrent neural network (Graph-VRNN), which is trained end-to-end to infer the current state of the (partially observed) world, as well as to forecast future states. We show that our method outperforms various baselines on two sports datasets, one based on real basketball trajectories, and one generated by a soccer game engine.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
cs.CV 2023-10 unverdicted novelty 7.0

Latent Consistency Models enable high-fidelity text-to-image generation in 2-4 steps by directly predicting solutions to the probability flow ODE in latent space, distilled from pre-trained LDMs.
Heteroscedastic Diffusion for Multi-Agent Trajectory Modeling
cs.LG 2026-05 unverdicted novelty 6.0

U2Diffine augments diffusion denoising with negative log-likelihood loss and first-order uncertainty propagation to jointly perform trajectory completion and provide per-state heteroscedastic uncertainty for multi-age...