pith. sign in

arxiv: 1609.09869 · v2 · pith:F7BAUWTKnew · submitted 2016-09-30 · 📊 stat.ML · cs.AI· cs.LG

Structured Inference Networks for Nonlinear State Space Models

classification 📊 stat.ML cs.AIcs.LG
keywords modelsalgorithmnetworksspacestatestructuredapproximationgenerative
0
0 comments X p. Extension
pith:F7BAUWTK Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{F7BAUWTK}

Prints a linked pith:F7BAUWTK badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Gaussian state space models have been used for decades as generative models of sequential data. They admit an intuitive probabilistic interpretation, have a simple functional form, and enjoy widespread adoption. We introduce a unified algorithm to efficiently learn a broad class of linear and non-linear state space models, including variants where the emission and transition distributions are modeled by deep neural networks. Our learning algorithm simultaneously learns a compiled inference network and the generative model, leveraging a structured variational approximation parameterized by recurrent neural networks to mimic the posterior distribution. We apply the learning algorithm to both synthetic and real-world datasets, demonstrating its scalability and versatility. We find that using the structured approximation to the posterior results in models with significantly higher held-out likelihood.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Robust Filter Attention: Self-Attention as Precision-Weighted State Estimation

    cs.LG 2025-09 unverdicted novelty 7.0

    Robust Filter Attention models self-attention as consistency-based state estimation under a linear SDE for token trajectories, matching standard attention complexity while showing lower perplexity and better zero-shot...