Deep Variational Sequential Monte Carlo for High-Dimensional Observations

Nir Shlezinger; Ruud J.G. van Sloun; Wessel L. van Nierop

arxiv: 2501.05982 · v1 · pith:KXWYJJJOnew · submitted 2025-01-10 · 💻 cs.LG · eess.SP

Deep Variational Sequential Monte Carlo for High-Dimensional Observations

Wessel L. van Nierop , Nir Shlezinger , Ruud J.G. van Sloun This is my paper

Pith reviewed 2026-05-23 05:24 UTC · model grok-4.3

classification 💻 cs.LG eess.SP

keywords sequential monte carloparticle filteringvariational inferenceneural networkslorenz attractorstate estimationhigh-dimensional observations

0 comments

The pith

Neural networks parameterize proposal and transition distributions in a differentiable particle filter to improve tracking of high-dimensional nonlinear systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a differentiable sequential Monte Carlo method that uses neural networks to learn the proposal and state-transition distributions from high-dimensional observations alone. Training relies on an unsupervised variational objective that maximizes an evidence lower bound without requiring labeled data or explicit system models. On the Lorenz attractor with partial observations, the approach yields lower tracking error than standard baselines and a tighter bound on the posterior. This suggests the learned distributions produce particle sets that better represent the true filtering distribution.

Core claim

By parameterizing the proposal and transition kernels of a particle filter with neural networks and optimizing them end-to-end via the variational SMC objective, the filter achieves more accurate state estimates and posterior approximations on high-dimensional chaotic systems such as the Lorenz attractor under partial observations.

What carries the argument

Differentiable particle filter whose proposal and transition distributions are realized by neural networks trained with the unsupervised variational SMC objective.

If this is right

The method can be applied to other nonlinear state-space models where observations are high-dimensional and only partially informative.
Posterior approximation quality improves as measured by the evidence lower bound without needing supervised labels.
End-to-end differentiability allows the filter to be embedded inside larger trainable pipelines for sequential inference.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same unsupervised training loop could be tested on real sensor streams such as video or multi-channel time series where ground-truth states are unavailable.
Replacing hand-designed proposals with learned ones may reduce the particle count needed for a given accuracy level in high-dimensional settings.
The approach opens a route to hybrid filters that combine the learned neural components with known physical constraints on the state dynamics.

Load-bearing premise

Neural networks can be trained to produce useful proposal and transition distributions solely from high-dimensional observations using the variational objective, without labeled trajectories or hand-specified dynamics.

What would settle it

A controlled experiment in which the same neural architecture, when trained on Lorenz data, produces particle-filtered trajectories whose root-mean-square error equals or exceeds that of a bootstrap particle filter with analytically chosen proposals would falsify the performance claim.

Figures

Figures reproduced from arXiv: 2501.05982 by Nir Shlezinger, Ruud J.G. van Sloun, Wessel L. van Nierop.

**Figure 2.** Figure 2: Example images of the Lorenz attractor (in the same position) using [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 5.** Figure 5: Decomposition of the ELBO for the DPF and the baselines for [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 4.** Figure 4: Tracking error of the DPF compared to the baseline methods for [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

read the original abstract

Sequential Monte Carlo (SMC), or particle filtering, is widely used in nonlinear state-space systems, but its performance often suffers from poorly approximated proposal and state-transition distributions. This work introduces a differentiable particle filter that leverages the unsupervised variational SMC objective to parameterize the proposal and transition distributions with a neural network, designed to learn from high-dimensional observations. Experimental results demonstrate that our approach outperforms established baselines in tracking the challenging Lorenz attractor from high-dimensional and partial observations. Furthermore, an evidence lower bound based evaluation indicates that our method offers a more accurate representation of the posterior distribution.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The paper proposes a differentiable particle filter that uses an unsupervised variational SMC objective to train neural networks parameterizing the proposal and state-transition distributions, enabling learning from high-dimensional observations without labeled data. It claims that this approach outperforms established baselines in tracking the Lorenz attractor under high-dimensional and partial observations, and that an ELBO-based evaluation shows improved posterior approximation.

Significance. If the experimental results hold with proper controls, the method could provide a practical way to improve SMC proposals in high-dimensional settings by leveraging variational objectives, potentially benefiting applications in nonlinear filtering where hand-designed proposals are inadequate. The unsupervised nature is a notable strength if validated.

major comments (1)

Abstract: the central claims of outperformance on the Lorenz attractor and improved ELBO rest on experimental results, but the abstract (and available text) provides no details on experimental setup, baselines, error bars, number of particles, observation dimensions, training protocol, or data exclusion criteria, rendering the claims unverifiable and load-bearing for the contribution.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their review and constructive comment. We address the point on the abstract below and will revise accordingly to strengthen the presentation of our results.

read point-by-point responses

Referee: [—] Abstract: the central claims of outperformance on the Lorenz attractor and improved ELBO rest on experimental results, but the abstract (and available text) provides no details on experimental setup, baselines, error bars, number of particles, observation dimensions, training protocol, or data exclusion criteria, rendering the claims unverifiable and load-bearing for the contribution.

Authors: We agree that the abstract should be more self-contained to allow readers to assess the claims without immediately consulting the full text. In the revised version we will expand the abstract to include the key experimental details: the number of particles, the observation dimensions and partial observation protocol for the Lorenz system, the specific baselines, the use of error bars or multiple runs, and a concise description of the unsupervised training protocol. The full experimental setup, including data generation and exclusion criteria, is already detailed in the experimental section; the revision will ensure the abstract summarizes these elements without altering the manuscript's technical content. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract describes a differentiable particle filter using an unsupervised variational SMC objective to parameterize proposal and transition distributions via neural networks, with performance claims evaluated on the external Lorenz attractor benchmark and ELBO-based posterior assessment. No equations, derivations, or self-citations are presented that reduce any claimed result to its own inputs by construction. The central claims rest on empirical outperformance against baselines rather than internal self-definition or fitted-input renaming, making the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review based solely on abstract; full paper details unavailable so ledger entries are minimal and provisional.

axioms (1)

domain assumption The variational SMC objective provides a suitable unsupervised training signal for neural-network-parameterized proposal and transition distributions.
Invoked as the core training mechanism in the abstract.

pith-pipeline@v0.9.0 · 5626 in / 1107 out tokens · 30930 ms · 2026-05-23T05:24:31.838483+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages

[1]

A New Approach to Linear Filtering and Prediction Problems,

R. E. Kalman, “A New Approach to Linear Filtering and Prediction Problems,” Journal of Basic Engineering , vol. 82, no. 1, pp. 35–45, Mar. 1960

work page 1960
[2]

A. H. Jazwinski, Stochastic Processes and Filtering Theory. Academic Press, Jan. 1970

work page 1970
[3]

New extension of the Kalman filter to nonlinear systems,

S. J. Julier and J. K. Uhlmann, “New extension of the Kalman filter to nonlinear systems,” in Signal Processing, Sensor Fusion, and Target Recognition VI, vol. 3068. SPIE, Jul. 1997, pp. 182–193

work page 1997
[4]

Novel approach to nonlinear/non-Gaussian Bayesian state estimation,

N. J. Gordon, D. J. Salmond, and A. F. M. Smith, “Novel approach to nonlinear/non-Gaussian Bayesian state estimation,” IEE Proceedings F (Radar and Signal Processing), vol. 140, no. 2, pp. 107–113, Apr. 1993

work page 1993
[5]

Elements of Sequential Monte Carlo,

C. A. Naesseth, F. Lindsten, and T. B. Sch ¨on, “Elements of Sequential Monte Carlo,” Mar. 2022

work page 2022
[6]

Rao- Blackwellised Particle Filtering for Dynamic Bayesian Networks,

A. Doucet, N. de Freitas, K. P. Murphy, and S. J. Russell, “Rao- Blackwellised Particle Filtering for Dynamic Bayesian Networks,” in Proceedings of the 16th Conference on Uncertainty in Artificial Intel- ligence, ser. UAI ’00. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., Jun. 2000, pp. 176–183

work page 2000
[7]

Multiple Particle Filtering,

P. M. Djuric, T. Lu, and M. F. Bugallo, “Multiple Particle Filtering,” in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP ’07, vol. 3, Apr. 2007, pp. III–1181–III–1184

work page 2007
[8]

Particle filtering for high-dimensional systems,

P. M. Djuric and M. F. Bugallo, “Particle filtering for high-dimensional systems,” in 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) . St. Martin, France: IEEE, Dec. 2013, pp. 352–355

work page 2013
[9]

Approximations of the Optimal Importance Density using Gaussian Particle Flow Importance Sampling,

P. Bunch and S. Godsill, “Approximations of the Optimal Importance Density using Gaussian Particle Flow Importance Sampling,” Nov. 2014

work page 2014
[10]

Gibbs flow for approximate transport with applications to Bayesian computation,

J. Heng, A. Doucet, and Y . Pokern, “Gibbs flow for approximate transport with applications to Bayesian computation,” Jan. 2020

work page 2020
[11]

An overview of differentiable particle filters for data- adaptive sequential Bayesian inference,

X. Chen and Y . Li, “An overview of differentiable particle filters for data- adaptive sequential Bayesian inference,” Foundations of Data Science , pp. 0–0, Tue Dec 26 00:00:00 EST 2023

work page 2023
[12]

Differ- entiable Particle Filtering via Entropy-Regularized Optimal Transport,

A. Corenflos, J. Thornton, G. Deligiannidis, and A. Doucet, “Differ- entiable Particle Filtering via Entropy-Regularized Optimal Transport,” Jun. 2021

work page 2021
[13]

Unsupervised Learning of Sampling Distributions for Particle Filters,

F. Gama, N. Zilberstein, M. Sevilla, R. Baraniuk, and S. Segarra, “Unsupervised Learning of Sampling Distributions for Particle Filters,” Feb. 2023

work page 2023
[14]

End-to-End Learning of Gaussian Mixture Proposals Using Differentiable Particle Filters and Neural Networks,

B. Cox, S. P ´erez-Vieites, N. Zilberstein, M. Sevilla, S. Segarra, and V . Elvira, “End-to-End Learning of Gaussian Mixture Proposals Using Differentiable Particle Filters and Neural Networks,” in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr. 2024, pp. 9701–9705

work page 2024
[15]

Normalising Flow-based Differentiable Particle Filters,

X. Chen and Y . Li, “Normalising Flow-based Differentiable Particle Filters,” Mar. 2024

work page 2024
[16]

Learning Differentiable Particle Filter on the Fly,

J. Li, X. Chen, and Y . Li, “Learning Differentiable Particle Filter on the Fly,” Dec. 2023

work page 2023
[17]

Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors,

R. Jonschkowski, D. Rastogi, and O. Brock, “Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors,” May 2018

work page 2018
[18]

Differentiable Particle Filtering without Modifying the Forward Pass,

A. ´Scibior and F. Wood, “Differentiable Particle Filtering without Modifying the Forward Pass,” Oct. 2021

work page 2021
[19]

Variational Sequential Monte Carlo,

C. A. Naesseth, S. W. Linderman, R. Ranganath, and D. M. Blei, “Variational Sequential Monte Carlo,” Feb. 2018

work page 2018
[20]

Filtering Variational Objectives,

C. J. Maddison, D. Lawson, G. Tucker, N. Heess, M. Norouzi, A. Mnih, A. Doucet, and Y . W. Teh, “Filtering Variational Objectives,” Nov. 2017

work page 2017
[21]

Auto-Encoding Sequential Monte Carlo,

T. A. Le, M. Igl, T. Rainforth, T. Jin, and F. Wood, “Auto-Encoding Sequential Monte Carlo,” Apr. 2018

work page 2018
[22]

A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,

M. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, “A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,” IEEE Transactions on Signal Processing , vol. 50, no. 2, pp. 174–188, Feb. 2002

work page 2002
[23]

Improved particle filter for nonlinear problems,

J. Carpenter, P. Clifford, and P. Fearnhead, “Improved particle filter for nonlinear problems,” IEE Proceedings - Radar, Sonar and Navigation , vol. 146, no. 1, pp. 2–7, Feb. 1999

work page 1999
[24]

Stochastic Backpropagation through Mixture Density Dis- tributions,

A. Graves, “Stochastic Backpropagation through Mixture Density Dis- tributions,” Jul. 2016

work page 2016
[25]

Implicit Reparameterization Gradients,

M. Figurnov, S. Mohamed, and A. Mnih, “Implicit Reparameterization Gradients,” Jan. 2019

work page 2019
[26]

Decoupled Weight Decay Regularization,

I. Loshchilov and F. Hutter, “Decoupled Weight Decay Regularization,” Jan. 2019

work page 2019
[27]

Deterministic Nonperiodic Flow,

E. N. Lorenz, “Deterministic Nonperiodic Flow,” Journal of Atmo- spheric Sciences, Mar. 1963

work page 1963
[28]

Latent-KalmanNet: Learned Kalman Filtering for Tracking from High-Dimensional Signals,

I. Buchnik, D. Steger, G. Revach, R. J. G. van Sloun, T. Routtenberg, and N. Shlezinger, “Latent-KalmanNet: Learned Kalman Filtering for Tracking from High-Dimensional Signals,” Apr. 2023

work page 2023
[29]

Combining Generative and Discriminative Models for Hybrid Inference,

V . Garcia Satorras, Z. Akata, and M. Welling, “Combining Generative and Discriminative Models for Hybrid Inference,” in Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc., 2019

work page 2019

[1] [1]

A New Approach to Linear Filtering and Prediction Problems,

R. E. Kalman, “A New Approach to Linear Filtering and Prediction Problems,” Journal of Basic Engineering , vol. 82, no. 1, pp. 35–45, Mar. 1960

work page 1960

[2] [2]

A. H. Jazwinski, Stochastic Processes and Filtering Theory. Academic Press, Jan. 1970

work page 1970

[3] [3]

New extension of the Kalman filter to nonlinear systems,

S. J. Julier and J. K. Uhlmann, “New extension of the Kalman filter to nonlinear systems,” in Signal Processing, Sensor Fusion, and Target Recognition VI, vol. 3068. SPIE, Jul. 1997, pp. 182–193

work page 1997

[4] [4]

Novel approach to nonlinear/non-Gaussian Bayesian state estimation,

N. J. Gordon, D. J. Salmond, and A. F. M. Smith, “Novel approach to nonlinear/non-Gaussian Bayesian state estimation,” IEE Proceedings F (Radar and Signal Processing), vol. 140, no. 2, pp. 107–113, Apr. 1993

work page 1993

[5] [5]

Elements of Sequential Monte Carlo,

C. A. Naesseth, F. Lindsten, and T. B. Sch ¨on, “Elements of Sequential Monte Carlo,” Mar. 2022

work page 2022

[6] [6]

Rao- Blackwellised Particle Filtering for Dynamic Bayesian Networks,

A. Doucet, N. de Freitas, K. P. Murphy, and S. J. Russell, “Rao- Blackwellised Particle Filtering for Dynamic Bayesian Networks,” in Proceedings of the 16th Conference on Uncertainty in Artificial Intel- ligence, ser. UAI ’00. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., Jun. 2000, pp. 176–183

work page 2000

[7] [7]

Multiple Particle Filtering,

P. M. Djuric, T. Lu, and M. F. Bugallo, “Multiple Particle Filtering,” in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP ’07, vol. 3, Apr. 2007, pp. III–1181–III–1184

work page 2007

[8] [8]

Particle filtering for high-dimensional systems,

P. M. Djuric and M. F. Bugallo, “Particle filtering for high-dimensional systems,” in 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) . St. Martin, France: IEEE, Dec. 2013, pp. 352–355

work page 2013

[9] [9]

Approximations of the Optimal Importance Density using Gaussian Particle Flow Importance Sampling,

P. Bunch and S. Godsill, “Approximations of the Optimal Importance Density using Gaussian Particle Flow Importance Sampling,” Nov. 2014

work page 2014

[10] [10]

Gibbs flow for approximate transport with applications to Bayesian computation,

J. Heng, A. Doucet, and Y . Pokern, “Gibbs flow for approximate transport with applications to Bayesian computation,” Jan. 2020

work page 2020

[11] [11]

An overview of differentiable particle filters for data- adaptive sequential Bayesian inference,

X. Chen and Y . Li, “An overview of differentiable particle filters for data- adaptive sequential Bayesian inference,” Foundations of Data Science , pp. 0–0, Tue Dec 26 00:00:00 EST 2023

work page 2023

[12] [12]

Differ- entiable Particle Filtering via Entropy-Regularized Optimal Transport,

A. Corenflos, J. Thornton, G. Deligiannidis, and A. Doucet, “Differ- entiable Particle Filtering via Entropy-Regularized Optimal Transport,” Jun. 2021

work page 2021

[13] [13]

Unsupervised Learning of Sampling Distributions for Particle Filters,

F. Gama, N. Zilberstein, M. Sevilla, R. Baraniuk, and S. Segarra, “Unsupervised Learning of Sampling Distributions for Particle Filters,” Feb. 2023

work page 2023

[14] [14]

End-to-End Learning of Gaussian Mixture Proposals Using Differentiable Particle Filters and Neural Networks,

B. Cox, S. P ´erez-Vieites, N. Zilberstein, M. Sevilla, S. Segarra, and V . Elvira, “End-to-End Learning of Gaussian Mixture Proposals Using Differentiable Particle Filters and Neural Networks,” in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr. 2024, pp. 9701–9705

work page 2024

[15] [15]

Normalising Flow-based Differentiable Particle Filters,

X. Chen and Y . Li, “Normalising Flow-based Differentiable Particle Filters,” Mar. 2024

work page 2024

[16] [16]

Learning Differentiable Particle Filter on the Fly,

J. Li, X. Chen, and Y . Li, “Learning Differentiable Particle Filter on the Fly,” Dec. 2023

work page 2023

[17] [17]

Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors,

R. Jonschkowski, D. Rastogi, and O. Brock, “Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors,” May 2018

work page 2018

[18] [18]

Differentiable Particle Filtering without Modifying the Forward Pass,

A. ´Scibior and F. Wood, “Differentiable Particle Filtering without Modifying the Forward Pass,” Oct. 2021

work page 2021

[19] [19]

Variational Sequential Monte Carlo,

C. A. Naesseth, S. W. Linderman, R. Ranganath, and D. M. Blei, “Variational Sequential Monte Carlo,” Feb. 2018

work page 2018

[20] [20]

Filtering Variational Objectives,

C. J. Maddison, D. Lawson, G. Tucker, N. Heess, M. Norouzi, A. Mnih, A. Doucet, and Y . W. Teh, “Filtering Variational Objectives,” Nov. 2017

work page 2017

[21] [21]

Auto-Encoding Sequential Monte Carlo,

T. A. Le, M. Igl, T. Rainforth, T. Jin, and F. Wood, “Auto-Encoding Sequential Monte Carlo,” Apr. 2018

work page 2018

[22] [22]

A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,

M. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, “A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,” IEEE Transactions on Signal Processing , vol. 50, no. 2, pp. 174–188, Feb. 2002

work page 2002

[23] [23]

Improved particle filter for nonlinear problems,

J. Carpenter, P. Clifford, and P. Fearnhead, “Improved particle filter for nonlinear problems,” IEE Proceedings - Radar, Sonar and Navigation , vol. 146, no. 1, pp. 2–7, Feb. 1999

work page 1999

[24] [24]

Stochastic Backpropagation through Mixture Density Dis- tributions,

A. Graves, “Stochastic Backpropagation through Mixture Density Dis- tributions,” Jul. 2016

work page 2016

[25] [25]

Implicit Reparameterization Gradients,

M. Figurnov, S. Mohamed, and A. Mnih, “Implicit Reparameterization Gradients,” Jan. 2019

work page 2019

[26] [26]

Decoupled Weight Decay Regularization,

I. Loshchilov and F. Hutter, “Decoupled Weight Decay Regularization,” Jan. 2019

work page 2019

[27] [27]

Deterministic Nonperiodic Flow,

E. N. Lorenz, “Deterministic Nonperiodic Flow,” Journal of Atmo- spheric Sciences, Mar. 1963

work page 1963

[28] [28]

Latent-KalmanNet: Learned Kalman Filtering for Tracking from High-Dimensional Signals,

I. Buchnik, D. Steger, G. Revach, R. J. G. van Sloun, T. Routtenberg, and N. Shlezinger, “Latent-KalmanNet: Learned Kalman Filtering for Tracking from High-Dimensional Signals,” Apr. 2023

work page 2023

[29] [29]

Combining Generative and Discriminative Models for Hybrid Inference,

V . Garcia Satorras, Z. Akata, and M. Welling, “Combining Generative and Discriminative Models for Hybrid Inference,” in Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc., 2019

work page 2019