Learning Discriminators for Resampling in the Ensemble Gaussian Mixture Filter through a Normalizing Flow Approach
Pith reviewed 2026-05-09 19:05 UTC · model grok-4.3
The pith
A normalizing flow discriminator filters out unrealistic particles during resampling to lower error in the ensemble Gaussian mixture filter when ensembles are small.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The discriminator-informed resampling procedure augments the posterior resampling step of the ensemble Gaussian mixture filter with a discriminator that accepts or rejects candidate particles based on physical plausibility. These discriminators are trained through a normalizing flow approach. On the Ikeda map and Lorenz 63 system the modified procedure produces consistently lower error than the standard ensemble Gaussian mixture filter in low-ensemble regimes.
What carries the argument
The discriminator-informed resampling step, in which a normalizing flow model serves as a learned gate that accepts only physically plausible particles.
If this is right
- Unrealistic posterior samples are rejected before they enter the forecast step.
- Filtering error decreases relative to the plain EnGMF when the ensemble size is small.
- The improvement appears on both the Ikeda map and the Lorenz 63 system.
- The base filter convergence properties remain unchanged while the resampling quality increases.
Where Pith is reading between the lines
- The same learned discriminator could be reused across multiple forecast cycles if the underlying dynamics stay stationary.
- Extending the discriminator to other particle filters facing similar realism problems would require only retraining the flow on the new system.
- If the flow overfits to training trajectories, long-term forecast skill may degrade even when short-term error drops.
Load-bearing premise
A normalizing flow trained on the target system can distinguish plausible particles from implausible ones without introducing systematic bias into the filter posterior.
What would settle it
Running the discriminator-informed procedure on the Lorenz 63 system with small ensembles and finding that the error does not decrease relative to the standard EnGMF would falsify the central claim.
Figures
read the original abstract
The ensemble Gaussian mixture filter (EnGMF) is a powerful, convergent particle filter capable of medium-to-high dimensional non-linear filtering. The EnGMF relies on a resampling step that can generate physically unrealistic posterior samples, that would subsequently produce physically meaningless forecasts. This work introduces the discriminator-informed resampling procedure, that augments the posterior resampling step with a discriminator that accepts or rejects candidate particles based on their physical plausibility. In this work these discriminators are learned through a normalizing flow approach. Numerical experiments on both the Ikeda map and the Lorenz '63 system show that discriminator informed resampling procedure consistently reduces error relative to the standard EnGMF in low-ensemble regimes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a discriminator-informed resampling procedure for the Ensemble Gaussian Mixture Filter (EnGMF), where a discriminator learned via normalizing flows accepts or rejects candidate particles based on physical plausibility. This is intended to prevent the generation of physically unrealistic posterior samples during resampling. Numerical experiments on the Ikeda map and Lorenz '63 system demonstrate that this procedure consistently reduces error relative to the standard EnGMF in low-ensemble regimes.
Significance. If the method maintains the unbiasedness and convergence properties of the EnGMF while effectively filtering out implausible particles, it represents a promising hybrid approach combining data-driven discriminators with traditional particle filters. This could have significant implications for improving the performance of ensemble-based data assimilation in nonlinear dynamical systems, particularly when ensemble sizes are limited due to computational constraints.
major comments (2)
- [Numerical experiments] The claim of consistent error reduction is not supported by any quantitative metrics, specific ensemble sizes, training details for the normalizing flow, or statistical tests. Without these, it is impossible to evaluate the practical significance or reproducibility of the reported improvements on the Ikeda map and Lorenz '63 system.
- [Theoretical analysis] The manuscript does not provide a proof or argument that the acceptance/rejection step using the learned discriminator preserves the posterior distribution or the convergence guarantees of the original EnGMF. This is load-bearing because any systematic bias introduced by the discriminator could invalidate the filter's theoretical properties and explain the observed error reduction as an artifact rather than a true improvement.
minor comments (1)
- [Abstract] The abstract could benefit from a brief mention of the specific normalizing flow architecture used or the training procedure to give readers a better sense of the method's implementation.
Simulated Author's Rebuttal
We thank the referee for their detailed review and constructive suggestions. We address each major comment below and outline the revisions we will make to the manuscript.
read point-by-point responses
-
Referee: [Numerical experiments] The claim of consistent error reduction is not supported by any quantitative metrics, specific ensemble sizes, training details for the normalizing flow, or statistical tests. Without these, it is impossible to evaluate the practical significance or reproducibility of the reported improvements on the Ikeda map and Lorenz '63 system.
Authors: We agree that additional quantitative details would strengthen the presentation. In the revised version, we will add a table summarizing the root mean square error (RMSE) reductions for specific ensemble sizes (N=10, 20, 50) on both the Ikeda map and Lorenz '63 systems. We will also include details on the normalizing flow architecture (e.g., number of layers, hidden units), training procedure (epochs, batch size, optimizer), and results from multiple independent runs with statistical significance tests (e.g., Wilcoxon signed-rank test) to confirm the improvements are consistent and not due to random variation. The current manuscript relies on visual comparisons in the figures, but we will enhance this with the requested metrics. revision: yes
-
Referee: [Theoretical analysis] The manuscript does not provide a proof or argument that the acceptance/rejection step using the learned discriminator preserves the posterior distribution or the convergence guarantees of the original EnGMF. This is load-bearing because any systematic bias introduced by the discriminator could invalidate the filter's theoretical properties and explain the observed error reduction as an artifact rather than a true improvement.
Authors: This is a valid concern. The EnGMF resampling generates particles from a Gaussian mixture model approximating the posterior, which can occasionally produce unphysical samples. Our discriminator, trained via normalizing flows to model the distribution of physically plausible states, is intended to reject such outliers. While we do not provide a formal proof that this exactly preserves the posterior (as the discriminator is an approximation), we will add a new subsection in the revised manuscript providing an argument based on the properties of acceptance-rejection sampling: if the discriminator accurately approximates the indicator function for the support of the true posterior, the procedure remains unbiased in the limit of perfect discrimination. We will also discuss the potential for bias in finite-sample cases and note that the empirical improvements suggest the bias, if present, is outweighed by the variance reduction. A full theoretical analysis of convergence is beyond the scope of this work but is planned for future research. revision: partial
- Rigorous proof that the acceptance/rejection step preserves the posterior distribution and convergence guarantees of the EnGMF
Circularity Check
No significant circularity
full rationale
The paper augments the EnGMF with a learned discriminator via normalizing flows for resampling and validates the approach through numerical experiments on the Ikeda map and Lorenz '63 system. No derivation chain, equation, or prediction reduces by construction to a fitted input or self-citation; the error-reduction claim rests on external empirical benchmarks rather than tautological redefinition of the method's own outputs. The base EnGMF convergence properties are treated as given from prior literature without the new component being forced by self-reference.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Density estimation for statistics and data analysis , author=. 1986 , publisher=
work page 1986
-
[2]
Biostatistics & Epidemiology , volume=
A tutorial on kernel density estimation and recent advances , author=. Biostatistics & Epidemiology , volume=. 2017 , publisher=
work page 2017
-
[3]
P. Janssen and J. S. Marron and N. Veraverbeke and W. Sarle , doi =. Scale measures for bandwidth selection , url =. 1995 , bdsk-url-1 =. https://doi.org/10.1080/10485259508832654 , journal =
-
[4]
Variable kernel estimates of multivariate densities , author=. Technometrics , volume=. 1977 , publisher=
work page 1977
-
[5]
IEEE Transactions on Aerospace and Electronic Systems , volume=
Clustering Methods for Particle Filters with Gaussian Mixture Models , author=. IEEE Transactions on Aerospace and Electronic Systems , volume=. 2021 , publisher=
work page 2021
-
[6]
IEEE Transactions on Signal Processing , volume=
Gaussian mixture nonlinear filtering with resampling for mixand narrowing , author=. IEEE Transactions on Signal Processing , volume=. 2016 , publisher=
work page 2016
-
[7]
Journal of Guidance, Control, and Dynamics , volume=
Gaussian sum reapproximation for use in a nonlinear filter , author=. Journal of Guidance, Control, and Dynamics , volume=. 2015 , publisher=
work page 2015
-
[8]
SIAM Journal on Scientific Computing , volume=
A multifidelity ensemble Kalman filter with reduced order control variates , author=. SIAM Journal on Scientific Computing , volume=. 2021 , publisher=
work page 2021
-
[9]
Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol
Multifidelity Data Assimilation for Physical Systems , author=. Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol. IV) , pages=. 2022 , publisher=
work page 2022
-
[10]
Bridging the ensemble Kalman filter and particle filters: the adaptive
Stordal, Andreas S and Karlsen, Hans A and N. Bridging the ensemble Kalman filter and particle filters: the adaptive. Computational Geosciences , volume=. 2011 , publisher=
work page 2011
-
[11]
Quarterly Journal of the Royal Meteorological Society , volume=
Particle filters for high-dimensional geoscience applications: A review , author=. Quarterly Journal of the Royal Meteorological Society , volume=. 2019 , publisher=
work page 2019
-
[12]
Anderson, Jeffrey L and Anderson, Stephen L , journal=. A
-
[13]
Efficient kernel-based ensemble
Liu, Bo and Ait-El-Fquih, Boujemaa and Hoteit, Ibrahim , journal=. Efficient kernel-based ensemble
-
[14]
Yun, Sehyun and Zanetti, Renato and Jones, Brandon A , journal=. Kernel-based ensemble. 2022 , publisher=
work page 2022
-
[15]
An adaptive covariance parameterization technique for the ensemble
Popov, Andrey A and Zanetti, Renato , journal=. An adaptive covariance parameterization technique for the ensemble. 2024 , publisher=
work page 2024
- [16]
-
[17]
Reifler, Benjamin L. and Popov, Andrey A. and Jones, Brandon A. and Zanetti, Renato , journal =. Large-scale space object tracking in a proliferated
-
[18]
and Zanetti, Renato , booktitle=
Popov, Andrey A. and Zanetti, Renato , booktitle=. Ensemble. 2023 , pages=
work page 2023
-
[19]
Ensemble-localized Kernel Density Estimation with Applications to the Ensemble
Popov, Andrey A and Zanetti, Renato , journal=. Ensemble-localized Kernel Density Estimation with Applications to the Ensemble
-
[20]
Reifler, Benjamin L and Yun, Sehyun and Jones, Brandon A and Zanetti, Renato , booktitle=. Multi-target ensemble
-
[21]
Reifler, Benjamin L. and Popov, Andrey A. and Jones, Brandon A. and Zanetti, Renato , booktitle =. Large-scale space object tracking in a proliferated
-
[22]
Nonlinear Bayesian estimation using
Alspach, Daniel and Sorenson, Harold , journal=. Nonlinear Bayesian estimation using. 1972 , publisher=
work page 1972
-
[23]
Recursive Bayesian estimation using Gaussian sums , author=. Automatica , volume=. 1971 , publisher=
work page 1971
-
[24]
Journal of guidance, control, and dynamics , volume=
Uncertainty propagation for nonlinear dynamic systems using Gaussian mixture models , author=. Journal of guidance, control, and dynamics , volume=
-
[25]
IEEE Signal Processing Letters , volume=
The split and merge unscented Gaussian mixture filter , author=. IEEE Signal Processing Letters , volume=. 2009 , publisher=
work page 2009
-
[26]
What are You Weighting For? Improved Weights for
Durant, Dalton and Popov, Andrey A and Zanetti, Renato , journal=. What are You Weighting For? Improved Weights for
-
[27]
and Zanetti, Renato , booktitle=
Popov, Andrey A. and Zanetti, Renato , booktitle=. Are Non-Gaussian Kernels Suitable for Ensemble Mixture Model Filtering? , day =
-
[28]
Durant, Dalton and Popov, Andrey A and Zanetti, Renato , booktitle=
-
[29]
IEEE Transactions on Aerospace and Electronic Systems , year=
Gaussian Mixture-Based Point Mass Filtering With Applications to Terrain-Relative Navigation , author=. IEEE Transactions on Aerospace and Electronic Systems , year=
-
[30]
Ensemble Gaussian Mixture Filter based on Projected Cram
Hanebeck, Uwe D and Prossel, Dominik and Popov, Andrey A and Giraldo-Grueso, Felipe and Zanetti, Renato , booktitle=. Ensemble Gaussian Mixture Filter based on Projected Cram. 2025 , organization=
work page 2025
-
[31]
Wang, Ziqi and Broccardo, Marco and Song, Junho , journal=. 2019 , publisher=
work page 2019
- [32]
-
[33]
Bayesian inference of chaotic dynamics by merging data assimilation, machine learning and expectation-maximization , author=. arXiv preprint arXiv:2001.06270 , year=
-
[34]
Neal, Radford M and Hinton, Geoffrey E , booktitle=. A view of the. 1998 , publisher=
work page 1998
-
[35]
Pattern recognition and machine learning , author=. 2006 , publisher=
work page 2006
- [36]
-
[37]
Estimation with applications to tracking and navigation: theory algorithms and software , author=. 2004 , publisher=
work page 2004
-
[38]
Daum, Fred and Huang, Jim and Noushin, Arjang , booktitle=. Gromov's method for. 2016 , organization=
work page 2016
-
[39]
IEEE Transactions on signal processing , volume=
Particle filters for positioning, navigation, and tracking , author=. IEEE Transactions on signal processing , volume=. 2002 , publisher=
work page 2002
-
[40]
Data assimilation: methods, algorithms, and applications , author=. 2016 , publisher=
work page 2016
-
[41]
Probabilistic forecasting and Bayesian data assimilation , author=. 2015 , publisher=
work page 2015
-
[42]
Nonlinear Processes in Geophysics , volume=
A Bayesian approach to multivariate adaptive localization in ensemble-based data assimilation with time-dependent extensions , author=. Nonlinear Processes in Geophysics , volume=. 2019 , publisher=
work page 2019
-
[43]
Bridging the ensemble Kalman and particle filters , author=. Biometrika , volume=. 2013 , publisher=
work page 2013
-
[44]
A nonparametric ensemble transform method for
Reich, Sebastian , journal=. A nonparametric ensemble transform method for. 2013 , publisher=
work page 2013
-
[45]
arXiv preprint arXiv:2003.13162 , year=
An explicit probabilistic derivation of inflation in a scalar ensemble Kalman filter for finite step, finite ensemble convergence , author=. arXiv preprint arXiv:2003.13162 , year=
- [46]
-
[47]
Proceedings of the 33rd AAS/AIAA Space Flight Mechanics Meeting , year=
Recursive update filtering: A new approach , author=. Proceedings of the 33rd AAS/AIAA Space Flight Mechanics Meeting , year=
-
[48]
2024 27th International Conference on Information Fusion (FUSION) , pages=
Particle flow with a continuous formulation of the nonlinear measurement update , author=. 2024 27th International Conference on Information Fusion (FUSION) , pages=. 2024 , organization=
work page 2024
-
[49]
Stochastic processes and filtering theory , author=. 2007 , publisher=
work page 2007
-
[50]
Ensemble Kalman filter implementations based on shrinkage covariance matrix estimation , author=. Ocean Dynamics , volume=. 2015 , publisher=
work page 2015
-
[51]
Nonlinear Processes in Geophysics , volume=
A stochastic covariance shrinkage approach to particle rejuvenation in the ensemble transform particle filter , author=. Nonlinear Processes in Geophysics , volume=. 2022 , publisher=
work page 2022
-
[52]
A stochastic covariance shrinkage approach in ensemble transform
Popov, Andrey A and Sandu, Adrian and Nino-Ruiz, Elias D and Evensen, Geir , journal=. A stochastic covariance shrinkage approach in ensemble transform
-
[53]
2009 ieee international conference on acoustics, speech and signal processing , pages=
Shrinkage estimation of high dimensional covariance matrices , author=. 2009 ieee international conference on acoustics, speech and signal processing , pages=. 2009 , organization=
work page 2009
-
[54]
Shrinkage algorithms for MMSE covariance estimation , Volume =
Chen, Yilun and Wiesel, Ami and Eldar, Yonina C and Hero, Alfred O , Journal =. Shrinkage algorithms for MMSE covariance estimation , Volume =
-
[55]
Shrinkage-to-tapering estimation of large covariance matrices , Volume =
Chen, Xiaohui and Wang, Z Jane and McKeown, Martin J , Journal =. Shrinkage-to-tapering estimation of large covariance matrices , Volume =
-
[56]
Robust shrinkage estimation of high-dimensional covariance matrices , Volume =
Chen, Yilun and Wiesel, Ami and Hero, Alfred O , Journal =. Robust shrinkage estimation of high-dimensional covariance matrices , Volume =
-
[57]
A well conditioned estimator for large dimensional covariance matrices , Url =
Ledoit, Olivier and Wolf, Michael , Date-Modified =. A well conditioned estimator for large dimensional covariance matrices , Url =. Journal of Multivariate Analysis , Number =
-
[58]
Society for industrial and applied mathematics, undergraduate research online , pages=
Attractors: Nonstrange to chaotic , author=. Society for industrial and applied mathematics, undergraduate research online , pages=
-
[59]
IEEE transactions on pattern analysis and machine intelligence , volume=
Normalizing flows: An introduction and review of current methods , author=. IEEE transactions on pattern analysis and machine intelligence , volume=. 2020 , publisher=
work page 2020
-
[60]
Advances in neural information processing systems , volume=
Neural spline flows , author=. Advances in neural information processing systems , volume=
-
[61]
A new approach to linear filtering and prediction problems , author=
-
[62]
Statistical multisource-multitarget information fusion , author=. 2007 , publisher=
work page 2007
-
[63]
Computers & Mathematics with Applications , volume=
Runge--Kutta pairs of order 5 (4) satisfying only the first column simplifying assumption , author=. Computers & Mathematics with Applications , volume=. 2011 , publisher=
work page 2011
- [64]
-
[65]
Dynamical systems, graphs, and algorithms , author=. 2006 , publisher=
work page 2006
-
[66]
Chaos: An Interdisciplinary Journal of Nonlinear Science , volume=
Mapping chaos: Bifurcation patterns and shrimp structures in the Ikeda map , author=. Chaos: An Interdisciplinary Journal of Nonlinear Science , volume=. 2024 , publisher=
work page 2024
-
[67]
Optics communications , volume=
Multiple-valued stationary state and its instability of the transmitted light by a ring cavity system , author=. Optics communications , volume=. 1979 , publisher=
work page 1979
-
[68]
Physical Review Letters , volume=
Optical turbulence: chaotic behavior of transmitted light from a ring cavity , author=. Physical Review Letters , volume=. 1980 , publisher=
work page 1980
-
[69]
Ordinary differential equations and dynamical systems , author=. 2021 , publisher=
work page 2021
-
[70]
SIAM Journal on control and Optimization , volume=
Projected Newton methods for optimization problems with simple constraints , author=. SIAM Journal on control and Optimization , volume=. 1982 , publisher=
work page 1982
-
[71]
Solving nonlinear equations with Newton's method , author=. 2003 , publisher=
work page 2003
-
[72]
Price, Harold J and Manson, Allison R , booktitle=. Uninformative priors for. 2002 , organization=
work page 2002
-
[73]
Predictability: A problem partly solved , author=. Proc. Seminar on predictability , volume=. 1996 , organization=
work page 1996
-
[74]
Novel approach to nonlinear/non-
Gordon, Neil J and Salmond, David J and Smith, Adrian FM , booktitle=. Novel approach to nonlinear/non-. 1993 , organization=
work page 1993
-
[75]
Advances in neural information processing systems , volume=
Generative adversarial nets , author=. Advances in neural information processing systems , volume=
- [76]
-
[77]
Journal of Machine Learning Research , volume=
Normalizing flows for probabilistic modeling and inference , author=. Journal of Machine Learning Research , volume=
-
[78]
Physica D: nonlinear phenomena , volume=
Measuring the strangeness of strange attractors , author=. Physica D: nonlinear phenomena , volume=. 1983 , publisher=
work page 1983
-
[79]
Lecture notes in mathematics , volume=
Functional differential equations and approximation of fixed points , author=. Lecture notes in mathematics , volume=. 1979 , publisher=
work page 1979
-
[80]
Mathematica in Action: Problem Solving Through Visualization and Computation , pages=
Computational Geometry , author=. Mathematica in Action: Problem Solving Through Visualization and Computation , pages=. 2010 , publisher=
work page 2010
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.