Geometric Dictionary Learning of Dynamical Systems with Optimal Transport
Pith reviewed 2026-05-20 00:16 UTC · model grok-4.3
The pith
Related dynamical systems lie near a low-dimensional manifold in spectral operator space that a learned dictionary can approximate for compact representations and faster estimation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We posit that related dynamical systems lie near a low-dimensional manifold in spectral operator space. Based on this hypothesis, we introduce DOODL (Dynamical OperatOr Dictionary Learning), a framework that learns a dictionary of characteristic spectral dynamics whose combinations approximate this manifold and yield compact, interpretable embeddings of individual systems. Beyond representation learning, DOODL enables fast and interpretable operator estimation from short and partially observed trajectories by constraining the estimation to the learned operator manifold.
What carries the argument
The DOODL dictionary of spectral operators whose combinations approximate the low-dimensional manifold in operator space.
If this is right
- Compact and interpretable embeddings of individual dynamical systems
- Fast operator estimation from short and partially observed trajectories
- Scaling to complex multiscale regimes such as metastable Langevin dynamics and turbulent plasma simulations
- Capturing characteristic spectral structure that governs long-term behavior rather than merely fitting observed trajectories
- Errors one to two orders of magnitude lower than independent operator estimation in low-data settings
Where Pith is reading between the lines
- The manifold assumption could be tested directly by measuring the intrinsic dimension of spectral operators across families of systems from different domains
- Dictionary learning in operator space might combine with transfer methods to initialize models for new but related dynamics
- The approach suggests examining whether other operator representations, such as Koopman or Perron-Frobenius operators, also admit low-dimensional structure across related systems
- Optimal transport distances between operators could serve as a general tool for aligning dynamics learned in separate experiments
Load-bearing premise
Related dynamical systems lie near a low-dimensional manifold in spectral operator space.
What would settle it
For a collection of related systems, showing that spectral operators cannot be approximated to high accuracy by combinations from a small learned dictionary, so that estimation error does not drop below the level achieved by fitting each system independently.
Figures
read the original abstract
Learning dynamical systems through operator-theoretic representations provides a powerful framework for analyzing complex dynamics, as spectral quantities such as eigenvalues and invariant structures encode characteristic time scales and long-term behavior. However, dynamical operators are typically estimated independently for each system, preventing the discovery of shared structure across related dynamics. To address this limitation, we posit that related dynamical systems lie near a low-dimensional manifold in spectral operator space. Based on this hypothesis, we introduce DOODL (Dynamical OperatOr Dictionary Learning), a framework that learns a dictionary of characteristic spectral dynamics whose combinations approximate this manifold and yield compact, interpretable embeddings of individual systems. Beyond representation learning, DOODL enables fast and interpretable operator estimation from short and partially observed trajectories by constraining the estimation to the learned operator manifold. Experiments on metastable Langevin dynamics and turbulent plasma simulations demonstrate that DOODL scales to highly complex multiscale regimes while capturing characteristic spectral structure governing the dynamics rather than merely fitting trajectories, achieving errors one to two orders of magnitude lower than independent operator estimation methods in challenging low-data regimes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper posits that related dynamical systems lie near a low-dimensional manifold in spectral operator space. It introduces DOODL, a dictionary-learning framework that learns a dictionary of characteristic spectral dynamics whose combinations approximate this manifold, yielding compact embeddings and enabling constrained, fast operator estimation from short or partially observed trajectories via optimal transport. Experiments on metastable Langevin dynamics and turbulent plasma simulations report one-to-two orders of magnitude error reduction versus independent operator estimation in low-data regimes.
Significance. If the manifold hypothesis and empirical gains hold, the work would offer a principled way to share spectral structure across related dynamical systems, improving sample efficiency and interpretability for multiscale physical models where data are scarce. The geometric dictionary-learning approach with optimal transport is a distinctive contribution that could influence operator-theoretic methods in dynamical systems.
major comments (2)
- [Abstract / Introduction] Abstract and opening of the introduction: the central hypothesis that 'related dynamical systems lie near a low-dimensional manifold in spectral operator space' is stated without derivation, theorem, or even a heuristic argument showing why spectral operators of related systems must concentrate rather than fill higher-dimensional regions. This assumption is load-bearing for the entire DOODL construction, the claimed geometric guarantees, and the superiority over independent estimation.
- [Abstract] Abstract (experimental claims): the reported 'one to two orders of magnitude' error reduction is presented without reference to the precise error metric, baseline implementations, number of independent trials, data-exclusion criteria, or statistical tests. Because the central claim rests on these unexamined results, the strength of the empirical support cannot be assessed from the provided description.
minor comments (2)
- [Abstract] The abstract mentions 'optimal transport' only in the title; a one-sentence description of how OT is used to enforce the geometric structure would improve readability.
- [Method section] Notation for the spectral operator space and the dictionary atoms is introduced without an early equation or diagram; a small schematic in §2 would clarify the embedding and reconstruction steps.
Simulated Author's Rebuttal
We thank the referee for their constructive and insightful comments. These have helped us identify areas where the manuscript can be strengthened in terms of motivation and clarity. We address each major comment point by point below, indicating the revisions we will make.
read point-by-point responses
-
Referee: [Abstract / Introduction] Abstract and opening of the introduction: the central hypothesis that 'related dynamical systems lie near a low-dimensional manifold in spectral operator space' is stated without derivation, theorem, or even a heuristic argument showing why spectral operators of related systems must concentrate rather than fill higher-dimensional regions. This assumption is load-bearing for the entire DOODL construction, the claimed geometric guarantees, and the superiority over independent estimation.
Authors: We agree that the manifold hypothesis is foundational and would benefit from explicit motivation. While a universal theorem may not exist for arbitrary unrelated systems, we will add a heuristic argument in the revised Introduction. The argument will note that for families of dynamical systems parameterized by a low-dimensional set of physical quantities (e.g., potential parameters in Langevin dynamics or forcing amplitudes in plasma models), the associated spectral operators vary continuously with these parameters under standard regularity conditions on the underlying stochastic processes. Consequently, the image of this parameter-to-operator map is expected to concentrate near a low-dimensional manifold in operator space. We will also clarify that this is a modeling assumption tailored to the multiscale physical regimes considered in the paper and is empirically validated by the low-rank dictionary structure recovered in Sections 4 and 5. We will discuss the assumption's scope and limitations for systems that are not continuously related. revision: yes
-
Referee: [Abstract] Abstract (experimental claims): the reported 'one to two orders of magnitude' error reduction is presented without reference to the precise error metric, baseline implementations, number of independent trials, data-exclusion criteria, or statistical tests. Because the central claim rests on these unexamined results, the strength of the empirical support cannot be assessed from the provided description.
Authors: We acknowledge that the abstract is overly concise on the experimental details. In the revised manuscript we will expand the abstract to specify the error metric (relative operator estimation error measured in the Frobenius norm), the baseline methods (independent dynamic mode decomposition and extended DMD), the number of independent trials (20), and the fact that all generated trajectories were retained with no exclusion criteria. Standard deviations across trials are reported in the main-text figures, and statistical significance is assessed via paired t-tests (detailed in the supplementary material). These elements are already present in Sections 4 and 5; the revision will simply surface the key qualifiers in the abstract while respecting length constraints. revision: yes
Circularity Check
No circularity: hypothesis explicitly posited as assumption with independent construction
full rationale
The paper explicitly introduces the low-dimensional manifold claim with the phrasing 'we posit that related dynamical systems lie near a low-dimensional manifold in spectral operator space' and then defines DOODL as a dictionary-learning procedure built on top of that assumption. No equations reduce the claimed predictions or embeddings back to fitted parameters by construction, no self-citations are invoked as load-bearing uniqueness theorems, and no ansatz is smuggled in via prior work. The derivation chain is therefore self-contained once the hypothesis is granted; the method's outputs are not equivalent to its inputs by definition.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Related dynamical systems lie near a low-dimensional manifold in spectral operator space.
invented entities (1)
-
Dictionary of characteristic spectral dynamics
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction contradicts?
contradictsCONTRADICTS: the theorem conflicts with this paper passage, or marks a claim that would need revision before publication.
We posit that related dynamical systems lie near a low-dimensional manifold in spectral operator space. Based on this hypothesis, we introduce DOODL...
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The reconstruction is: Bp(α; G) ≜ PN(∑ αj Λj, ∑ αj Lj, ∑ αj Rj). ... Riemannian gradient methods on the quotient manifold M
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Princeton University Press, 2008
P-A Absil.Optimization algorithms on matrix manifolds. Princeton University Press, 2008
work page 2008
-
[2]
Projection-like retractions on matrix manifolds.SIAM Journal on Optimization, 22(1):135–158, 2012
P-A Absil and Jérôme Malick. Projection-like retractions on matrix manifolds.SIAM Journal on Optimization, 22(1):135–158, 2012
work page 2012
-
[3]
Distances on spaces of high-dimensional linear stochastic processes: a survey
Bijan Afsari and René Vidal. Distances on spaces of high-dimensional linear stochastic processes: a survey. InGeometric Theory of Information, pages 219–242. Springer, 2014
work page 2014
-
[4]
Random fourier features for kernel ridge regression: Approximation bounds and statistical guarantees
Haim Avron, Michael Kapralov, Cameron Musco, Christopher Musco, Ameya Velingker, and Amir Zandieh. Random fourier features for kernel ridge regression: Approximation bounds and statistical guarantees. InInternational conference on machine learning, pages 253–262. PMLR, 2017
work page 2017
-
[5]
Jean-Pierre Boeuf and Andrei Smolyakov. Physics and instabilities of low-temperature E×B plasmas for spacecraft propulsion and other applications.Physics of Plasmas, 30(5):050901, 2023
work page 2023
-
[6]
Bolhuis, David Chandler, Christoph Dellago, and Phillip L
Peter G. Bolhuis, David Chandler, Christoph Dellago, and Phillip L. Geissler. Transition path sampling: Throwing ropes over rough mountain passes, in the dark.Annual Review of Physical Chemistry, 53(V olume 53, 2002):291–318, 2002
work page 2002
-
[7]
Cambridge University Press, 2023
Nicolas Boumal.An introduction to optimization on smooth manifolds. Cambridge University Press, 2023
work page 2023
-
[8]
Random fourier features for operator- valued kernels
Romain Brault, Markus Heinonen, and Florence Buc. Random fourier features for operator- valued kernels. In Robert J. Durrant and Kee-Eung Kim, editors,Proceedings of The 8th Asian Conference on Machine Learning, volume 63 ofProceedings of Machine Learning Research, pages 110–125, The University of Waikato, Hamilton, New Zealand, 16–18 Nov 2016. PMLR
work page 2016
-
[9]
Brunton, Marko Budiši´c, Eurika Kaiser, and J
Steven L. Brunton, Marko Budiši´c, Eurika Kaiser, and J. Nathan Kutz. Modern Koopman theory for dynamical systems.SIAM Review, 64(2):229–340, 2022
work page 2022
-
[10]
Initial-state invariant binet-cauchy kernels for the comparison of linear dynamical systems
Rizwan Chaudhry and René Vidal. Initial-state invariant binet-cauchy kernels for the comparison of linear dynamical systems. In52nd IEEE Conference on Decision and Control, pages 5377–
-
[11]
Gradnorm: Gradient normalization for adaptive loss balancing in deep multitask networks
Zhao Chen, Vijay Badrinarayanan, Chen-Yu Lee, and Andrew Rabinovich. Gradnorm: Gradient normalization for adaptive loss balancing in deep multitask networks. InProceedings of the 35th International Conference on Machine Learning, volume 80 ofProceedings of Machine Learning Research, pages 794–803. PMLR, 2018
work page 2018
-
[12]
Anoop Cherian and Suvrit Sra. Riemannian dictionary learning and sparse coding for positive definite matrices.IEEE transactions on neural networks and learning systems, 28(12):2859– 2871, 2016
work page 2016
-
[13]
Reinart Coosemans, Wouter Dekeyser, and Martine Baelmans. Bayesian analysis of turbulent transport coefficients in 2d interchange dominated ExB turbulence involving flow shear.Journal of Physics: Conference Series, 1785:012001, 2021
work page 2021
-
[14]
Fast computation of wasserstein barycenters
Marco Cuturi and Arnaud Doucet. Fast computation of wasserstein barycenters. InInternational conference on machine learning, pages 685–693. PMLR, 2014
work page 2014
-
[15]
Farbod Faraji, Maryam Reza, Aaron Knoll, and J. Nathan Kutz. Dynamic mode decomposition for data-driven analysis and reduced-order modelling of E×B plasmas: I. extraction of spatiotemporally coherent patterns.Journal of Physics D: Applied Physics, 57:065201, 2024
work page 2024
- [16]
-
[17]
Thibaut Germain, Rémi Flamary, Vladimir R. Kostic, and Karim Lounici. A spectral-grassmann wasserstein metric for operator representations of dynamical systems. 2026
work page 2026
-
[18]
P. Ghendrih, Y . Asahi, E. Caschera, G. Dif-Pradalier, P. Donnel, X. Garbet, C. Gillot, V . Grand- girard, G. Latu, Y . Sarazin, et al. Generation and dynamics of SOL corrugated profiles.Journal of Physics: Conference Series, 1125:012011, 2018
work page 2018
-
[19]
Philippe Ghendrih, Guilhem Dif-Pradalier, Olivier Panico, Yanick Sarazin, Hugo Bufferand, Guido Ciraolo, Peter Donnel, Nicolas Fedorczak, Xavier Garbet, Virginie Grandgirard, et al. Role of avalanche transport in competing drift wave and interchange turbulence.Journal of Physics: Conference Series, 2397:012018, 2022
work page 2022
-
[20]
Optimization on the biorthogonal manifold
Klaus Glashoff and Michael M Bronstein. Optimization on the biorthogonal manifold.arXiv preprint arXiv:1609.04161, 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[21]
Tokam2D: A 2d spectral solver for turbulence schemes
GYSELAX Team. Tokam2D: A 2d spectral solver for turbulence schemes. https://github. com/gyselax/tokam2d, 2026. Accessed: 2026-05-01
work page 2026
-
[22]
Dictionary learning and sparse coding on grassmann manifolds: An extrinsic solution
Mehrtash Harandi, Conrad Sanderson, Chunhua Shen, and Brian C Lovell. Dictionary learning and sparse coding on grassmann manifolds: An extrinsic solution. InProceedings of the IEEE international conference on computer vision, pages 3120–3127, 2013
work page 2013
-
[23]
Sparse coding and dictionary learning for symmetric positive definite matrices: A kernel approach
Mehrtash T Harandi, Conrad Sanderson, Richard Hartley, and Brian C Lovell. Sparse coding and dictionary learning for symmetric positive definite matrices: A kernel approach. InEuropean conference on computer vision, pages 216–229. Springer, 2012
work page 2012
-
[24]
Poseidon: Efficient foundation models for pdes, 2024
Maximilian Herde, Bogdan Raoni ´c, Tobias Rohner, Roger Käppeli, Roberto Molinaro, Em- manuel de Bézenac, and Siddhartha Mishra. Poseidon: Efficient foundation models for pdes, 2024
work page 2024
-
[25]
On a nonlinear generalization of sparse coding and dictionary learning
Jeffrey Ho, Yuchen Xie, and Baba Vemuri. On a nonlinear generalization of sparse coding and dictionary learning. InInternational conference on machine learning, pages 1480–1488. PMLR, 2013
work page 2013
-
[26]
Zhiwu Huang, Luc Van Gool, and Johan A. K. Suykens. Sparse coding and dictionary learning for linear dynamical systems. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 543–550, 2016
work page 2016
-
[27]
A metric on nonlinear dynamical systems with perron–frobenius operators
Issei Ishikawa, Keisuke Fujii, Masahiro Ikeda, Yuka Hashimoto, and Yoshinobu Kawahara. A metric on nonlinear dynamical systems with perron–frobenius operators. InAdvances in Neural Information Processing Systems, volume 31, 2018. 11
work page 2018
-
[28]
Hachem Kadri, Emmanuel Duflos, Philippe Preux, Stéphane Canu, Alain Rakotomamonjy, and Julien Audiffren. Operator-valued kernels for learning from functional response data.Journal of Machine Learning Research, 17(20):1–54, 2016
work page 2016
-
[29]
Dynamic mode decomposition with reproducing kernels for koopman spectral analysis.NeurIPS, 2016
Yoshinobu Kawahara. Dynamic mode decomposition with reproducing kernels for koopman spectral analysis.NeurIPS, 2016
work page 2016
-
[30]
On the numerical approximation of the perron-frobenius and koopman operator
Stefan Klus, Feliks Nüske, Peter Koltai, Hao Wu, Ioannis G Kevrekidis, Christof Schütte, and Frank Noé. On the numerical approximation of the perron-frobenius and koopman operator. Journal of Computational Dynamics, 2018
work page 2018
-
[31]
Bernard O. Koopman. Hamiltonian systems and transformation in hilbert space.Proceedings of the National Academy of Sciences, 17(5):315–318, 1931
work page 1931
-
[32]
V . R. Kostic, P. Novelli, R. Grazzi, K. Lounici, and M. Pontil. Learning invariant representations of time-homogeneous stochastic dynamical systems. InInternational Conference on Learning Representations, 2024
work page 2024
-
[33]
Vladimir Kostic, Karim Lounici, Pietro Novelli, and Massimiliano Pontil. Sharp spectral rates for koopman operator learning.Advances in Neural Information Processing Systems, 36:32328–32339, 2023
work page 2023
-
[34]
Vladimir Kostic, Pietro Novelli, Andreas Maurer, Carlo Ciliberto, Lorenzo Rosasco, and Massi- miliano Pontil. Learning dynamical systems via koopman operator regression in reproducing kernel hilbert spaces.Advances in Neural Information Processing Systems, 35:4017–4031, 2022
work page 2022
-
[35]
Kostic, Karim Lounici, Hélène Halconruy, Timothée Devergne, Pietro Novelli, and Massimiliano Pontil
Vladimir R. Kostic, Karim Lounici, Hélène Halconruy, Timothée Devergne, Pietro Novelli, and Massimiliano Pontil. Laplace transform based low-complexity learning of continuous markov semigroups. InProceedings of the 42nd International Conference on Machine Learning, volume 267 ofProceedings of Machine Learning Research. PMLR, 2025
work page 2025
-
[36]
Kostic, Karim Lounici, Grégoire Pacreau, Giacomo Turri, Pietro Novelli, and Massimiliano Pontil
Vladimir R. Kostic, Karim Lounici, Grégoire Pacreau, Giacomo Turri, Pietro Novelli, and Massimiliano Pontil. Neural conditional probability for uncertainty quantification. InAdvances in Neural Information Processing Systems, 2024
work page 2024
-
[37]
Kostic, Karim Lounici, and Massimiliano Pontil
Vladimir R. Kostic, Karim Lounici, and Massimiliano Pontil. Toeplitz based spectral methods for data-driven dynamical systems, 2026
work page 2026
-
[38]
Kernel methods for koopman operator learning
Vladimir Kosti´c, Jean-Baptiste Fermanian, et al. Kernel methods for koopman operator learning. InNeurIPS, 2022
work page 2022
-
[39]
Samuel Lanthaler and Nicholas H. Nelsen. Error bounds for learning with vector-valued random features. InThirty-seventh Conference on Neural Information Processing Systems, 2023
work page 2023
-
[40]
Mackey.Chaos, Fractals, and Noise, volume 97 ofApplied Mathematical Sciences
Andrzej Lasota and Michael C. Mackey.Chaos, Fractals, and Noise, volume 97 ofApplied Mathematical Sciences. Springer New York, 1994
work page 1994
-
[41]
Log-euclidean kernels for sparse representation and dictionary learning
Peihua Li, Qilong Wang, Wangmeng Zuo, and Lei Zhang. Log-euclidean kernels for sparse representation and dictionary learning. InProceedings of the IEEE international conference on computer vision, pages 1601–1608, 2013
work page 2013
-
[42]
Qianxiao Li, Felix Dietrich, Erik M. Bollt, and Ioannis G. Kevrekidis. Extended dynamic mode decomposition with dictionary learning: a data-driven adaptive spectral decomposition of the koopman operator.Chaos, 27(10):103111, 2017
work page 2017
-
[43]
Physics-informed koop- man network for time-series prediction of dynamical systems
Yuying Liu, Aleksei Sholokhov, Hassan Mansour, and Saleh Nabi. Physics-informed koop- man network for time-series prediction of dynamical systems. InICLR 2024 Workshop on AI4DifferentialEquations In Science, 2024
work page 2024
-
[44]
Julien Mairal, Francis Bach, and Jean Ponce. Task-driven dictionary learning.IEEE transactions on pattern analysis and machine intelligence, 34(4):791–804, 2011. 12
work page 2011
-
[45]
Online dictionary learning for sparse coding
Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro. Online dictionary learning for sparse coding. InProceedings of the 26th annual international conference on machine learning, pages 689–696, 2009
work page 2009
-
[46]
A metric for arma processes.IEEE transactions on Signal Processing, 48(4):1164–1170, 2002
Richard J Martin. A metric for arma processes.IEEE transactions on Signal Processing, 48(4):1164–1170, 2002
work page 2002
-
[47]
Igor Mezi´c. On comparison of dynamics of dissipative and finite-time systems using koopman operator methods.IFAC-PapersOnLine, 49(18):454–461, 2016
work page 2016
-
[48]
Comparison of systems with complex behavior.Physica D: Nonlinear Phenomena, 197(1-2):101–133, 2004
Igor Mezi´c and Andrzej Banaszuk. Comparison of systems with complex behavior.Physica D: Nonlinear Phenomena, 197(1-2):101–133, 2004
work page 2004
-
[49]
Hà Quang Minh. Operator-valued bochner theorem, Fourier feature maps for operator-valued kernels, and vector-valued learning.arXiv preprint arXiv:1608.05639, 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[50]
Bruno A Olshausen and David J Field. Emergence of simple-cell receptive field properties by learning a sparse code for natural images.Nature, 381(6583):607–609, 1996
work page 1996
-
[51]
Random features for large-scale kernel machines
Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines. InAdvances in Neural Information Processing Systems, volume 20, 2007
work page 2007
-
[52]
Emmanuel Rio. Covariance inequalities for strongly mixing processes.Annales de l’Institut Henri Poincaré, Probabilités et Statistiques, 29(4):587–597, 1993
work page 1993
- [53]
-
[54]
Morgan A Schmitz, Matthieu Heitz, Nicolas Bonneel, Fred Ngole, David Coeurjolly, Marco Cuturi, Gabriel Peyré, and Jean-Luc Starck. Wasserstein dictionary learning: Optimal transport-based unsupervised nonlinear dictionary learning.SIAM Journal on Imaging Sciences, 11(1):643–678, 2018
work page 2018
-
[55]
Ch. Schütte, W. Huisinga, and P. Deuflhard. Transfer operator approach to conformational dynamics in biomolecular systems. In Bernold Fiedler, editor,Ergodic Theory, Analysis, and Efficient Simulation of Dynamical Systems, pages 191–223, Berlin, Heidelberg, 2001. Springer Berlin Heidelberg
work page 2001
-
[56]
Christof Schütte, Stefan Klus, and Carsten Hartmann. Overcoming the timescale barrier in molecular dynamics: Transfer operators, variational principles and machine learning.Acta Numerica, 32:517–673, 2023
work page 2023
-
[57]
Roy Taylor, J. Nathan Kutz, Kyle D. Morgan, and Brian A. Nelson. Dynamic mode decompo- sition for plasma diagnostics and validation.Review of Scientific Instruments, 89(5):053501, 2018
work page 2018
-
[58]
Dictionary learning.IEEE Signal Processing Magazine, 28(2):27–38, 2011
Ivana Toši´c and Pascal Frossard. Dictionary learning.IEEE Signal Processing Magazine, 28(2):27–38, 2011
work page 2011
-
[59]
Nilesh Tripuraneni, Chi Jin, and Michael I. Jordan. Provable meta-learning of linear representa- tions. InProceedings of the 38th International Conference on Machine Learning, 2021
work page 2021
-
[60]
Hien Van Nguyen, Vishal M Patel, Nasser M Nasrabadi, and Rama Chellappa. Kernel dictionary learning. In2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2021–2024. IEEE, 2012
work page 2021
-
[61]
Hien Van Nguyen, Vishal M Patel, Nasser M Nasrabadi, and Rama Chellappa. Design of non-linear kernel dictionaries for object recognition.IEEE Transactions on Image Processing, 22(12):5123–5135, 2013
work page 2013
-
[62]
S. V . N. Vishwanathan, Alexander J. Smola, and René Vidal. Binet-cauchy kernels on dynamical systems and its application to the analysis of dynamic scenes.International Journal of Computer Vision, 73(1):95–119, 2007
work page 2007
-
[63]
Matthew O Williams, Clarence W Rowley, and Ioannis G Kevrekidis. A kernel-based method for data-driven koopman spectral analysis.Journal of Computational Dynamics, 2015. 13
work page 2015
-
[64]
Hao Wu, Feliks Nüske, Fabian Paul, Stefan Klus, Péter Koltai, and Frank Noé. Variational Koopman models: Slow collective variables and molecular kinetics from short off-equilibrium simulations.The Journal of Chemical Physics, 146(15):154104, 2017. A Operator-Theoretic Foundations A.1 Operator representations and estimation Markov semigroup and transfer op...
work page 2017
-
[65]
Coefficient estimation:fix the dictionary and optimize {αi}i∈[b] via gradient descent (with softmax parametrization). Since the problem is not convex, we use an initialization of based on proximity to dictionary atoms, i.e. αi ∝ {−d S(Gi, Gj)}j∈[d] to start the optimization in a relevant region of the parameter space. In practice we use the Adam optimizer...
-
[66]
We leverage the envelope theorem to ignore implicit gradients through{α i}i∈[b]
Dictionary update:fix the coordinates {αi}i∈[b] and update the dictionary with a Riemannian gradient step on N d from the objective on the batch. We leverage the envelope theorem to ignore implicit gradients through{α i}i∈[b]. Default gradient steps has a learning rate of 1e-2. Algorithm 2 describes the overall optimization procedure. Computation of the d...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.