A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning

Marco Fraccaro; Ole Winther; Simon Kamronn; Ulrich Paquet

arxiv: 1710.05741 · v2 · pith:C3IYCVGUnew · submitted 2017-10-16 · 📊 stat.ML · cs.LG

A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning

Marco Fraccaro , Simon Kamronn , Ulrich Paquet , Ole Winther This is my paper

classification 📊 stat.ML cs.LG

keywords datadynamicslatentmodelframeslearningmissingrecognition

0 comments

read the original abstract

This paper takes a step towards temporal reasoning in a dynamically changing video, not in the pixel space that constitutes its frames, but in a latent space that describes the non-linear dynamics of the objects in its world. We introduce the Kalman variational auto-encoder, a framework for unsupervised learning of sequential data that disentangles two latent representations: an object's representation, coming from a recognition model, and a latent state describing its dynamics. As a result, the evolution of the world can be imagined and missing data imputed, both without the need to generate high dimensional frames at each time step. The model is trained end-to-end on videos of a variety of simulated physical systems, and outperforms competing methods in generative and missing data imputation tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Hybrid Adaptive Kalman Filtering for Data-Efficient Joint Tracking and Classification
cs.RO 2026-06 unverdicted novelty 6.0

Self-supervised hybrid adaptive Kalman filter learns structured corrections for data-efficient joint tracking and classification.