FlowMixer: A Depth-Agnostic Neural Architecture for Interpretable Spatiotemporal Forecasting
Pith reviewed 2026-05-22 13:37 UTC · model grok-4.3
The pith
A single constrained layer with non-negative mixing and reversible maps can represent any network depth through composition while extracting interpretable Koopman eigenmodes for forecasting.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
FlowMixer shows that a single layer of constrained non-negative matrix mixing inside a reversible mapping framework possesses a semi-group property, allowing it to mathematically represent arbitrary depths through composition. The same construction produces a Kronecker-Koopman eigenmodes framework that extracts interpretable spatiotemporal patterns and supports direct algebraic manipulation of prediction horizons.
What carries the argument
The semi-group property generated by composing reversible non-negative matrix mixing operations, which encodes arbitrary depth and supplies the Kronecker-Koopman eigenmodes for interpretability.
If this is right
- Depth search is eliminated for this class of spatiotemporal models.
- Prediction horizons can be adjusted algebraically after training without retraining.
- Eigenmodes extracted from the trained model provide direct links to dynamical systems analysis.
- Competitive long-horizon forecasting accuracy is achieved while preserving mathematical interpretability across physical domains.
Where Pith is reading between the lines
- Similar semi-group constructions could be tested in other sequence modeling tasks where depth selection currently dominates compute budgets.
- The eigenmode representation may allow stability analysis or control design directly on the learned operator.
- The reversible mixing framework might extend to settings that require invertible transformations for uncertainty quantification or data assimilation.
Load-bearing premise
Constrained non-negative matrix mixing layers inside reversible mappings can capture the full range of structured spatiotemporal dynamics without needing multiple layers or losing expressivity on complex physical phenomena.
What would settle it
A dataset of turbulent flow or chaotic attractors on which a multi-layer baseline consistently outperforms FlowMixer or on which the extracted eigenmodes fail to align with independently computed dynamical modes of the system.
Figures
read the original abstract
We introduce FlowMixer, a single-layer neural architecture that leverages constrained matrix operations to model structured spatiotemporal patterns with enhanced interpretability. FlowMixer incorporates non-negative matrix mixing layers within a reversible mapping framework - applying transforms before mixing and their inverses afterward. This shape-preserving design enables a Kronecker-Koopman eigenmodes framework that bridges statistical learning with dynamical systems theory, providing interpretable spatiotemporal patterns and facilitating direct algebraic manipulation of prediction horizons without retraining. The architecture's semi-group property enables this single layer to mathematically represent any depth through composition, eliminating depth search entirely. Extensive experiments across diverse domains demonstrate FlowMixer's long-horizon forecasting capabilities while effectively modeling physical phenomena such as chaotic attractors and turbulent flows. Our results achieve performance matching state-of-the-art methods while offering superior interpretability through directly extractable eigenmodes. This work suggests that architectural constraints can simultaneously maintain competitive performance and enhance mathematical interpretability in neural forecasting systems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces FlowMixer, a single-layer neural architecture for spatiotemporal forecasting that employs non-negative matrix mixing layers inside a reversible mapping framework. It proposes a Kronecker-Koopman eigenmodes approach to bridge statistical learning and dynamical systems, enabling interpretable patterns and algebraic manipulation of prediction horizons. The central claim is that the architecture's semi-group property allows this single layer to represent arbitrary depths through composition, eliminating depth hyperparameter search. Experiments across domains are reported to match state-of-the-art performance while providing superior interpretability via directly extractable eigenmodes.
Significance. If the semi-group closure and expressivity preservation hold under the non-negativity and reversibility constraints, the work offers a mathematically grounded alternative to deep networks for forecasting, potentially removing depth tuning while linking to Koopman theory for interpretability. The direct horizon manipulation without retraining would be a practical strength if rigorously shown.
major comments (2)
- [Abstract and Section 3 (FlowMixer Architecture)] The manuscript's central claim (abstract and architecture section) that the semi-group property enables a single layer to mathematically represent any depth via composition rests on an unshown derivation of closure under composition for non-negative mixing matrices combined with reversible pre/post transforms. This is load-bearing for the depth-agnostic assertion, as non-negativity restricts mixing to conical combinations that may not preserve full expressivity for sign-changing or higher-order couplings.
- [Section 4 (Kronecker-Koopman Eigenmodes)] Section 4 (Kronecker-Koopman Eigenmodes): The eigenmodes are presented as bridging statistical learning and dynamical systems, but it is not shown whether they are derived independently of the fitted mixing parameters or extracted post-training from the learned matrices. If the latter, this risks circularity in the interpretability and algebraic manipulation claims.
minor comments (2)
- [Abstract] The abstract states performance matching state-of-the-art but provides no quantitative results, error bars, or ablation details on the depth-agnostic property; moving key metrics to the abstract would improve clarity.
- [Section 3] Notation for the constrained mixing matrix and reversible transforms could be standardized across equations to avoid ambiguity in the composition operation.
Simulated Author's Rebuttal
We thank the referee for their constructive comments. We address each major comment below and note the corresponding revisions.
read point-by-point responses
-
Referee: [Abstract and Section 3 (FlowMixer Architecture)] The manuscript's central claim (abstract and architecture section) that the semi-group property enables a single layer to mathematically represent any depth via composition rests on an unshown derivation of closure under composition for non-negative mixing matrices combined with reversible pre/post transforms. This is load-bearing for the depth-agnostic assertion, as non-negativity restricts mixing to conical combinations that may not preserve full expressivity for sign-changing or higher-order couplings.
Authors: We acknowledge that an explicit derivation of semi-group closure for the non-negative mixing matrices under the reversible pre/post-transform constraints was not provided in the original manuscript. In the revision we will insert a formal proof in Section 3 establishing that the constrained matrices remain closed under composition, thereby supporting the depth-agnostic claim. On expressivity, the reversible transforms expand the representable couplings beyond pure conical combinations; our experiments on chaotic and turbulent systems indicate that the resulting expressivity is adequate for the forecasting tasks considered. We will add a short discussion of this limitation and its empirical mitigation. revision: yes
-
Referee: [Section 4 (Kronecker-Koopman Eigenmodes)] Section 4 (Kronecker-Koopman Eigenmodes): The eigenmodes are presented as bridging statistical learning and dynamical systems, but it is not shown whether they are derived independently of the fitted mixing parameters or extracted post-training from the learned matrices. If the latter, this risks circularity in the interpretability and algebraic manipulation claims.
Authors: The eigenmodes are extracted after training from the learned mixing matrices via the Kronecker-Koopman construction in Section 4. This procedure is not circular: the parameters are optimized to minimize forecasting loss, after which the algebraic semi-group structure permits direct eigenmode manipulation for arbitrary horizons. We will revise Section 4 to state the extraction timing explicitly and to separate the training objective from the post-hoc algebraic interpretation. revision: partial
Circularity Check
No significant circularity; claims follow from architectural constraints without reduction to inputs by construction
full rationale
The paper defines FlowMixer via explicit design choices (non-negative matrix mixing inside reversible pre/post transforms) and then derives the semi-group property, Kronecker-Koopman eigenmodes, and depth-agnostic composition as consequences of those constraints. No equation or step is shown to define a quantity in terms of itself or to relabel a fitted parameter as an independent prediction. The algebraic horizon manipulation is presented as a direct result of the eigenmode structure extracted from the learned matrices, but this remains a post-design property rather than a tautological re-expression of the training data. The derivation chain is therefore self-contained against external benchmarks of expressivity and interpretability.
Axiom & Free-Parameter Ledger
free parameters (1)
- mixing matrix entries
axioms (2)
- domain assumption The composition of reversible transforms and matrix mixing forms a semi-group under which a single layer equals arbitrary depth.
- domain assumption Kronecker-Koopman eigenmodes can be directly extracted from the learned architecture to yield interpretable spatiotemporal patterns.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The architecture's semi-group property enables this single layer to mathematically represent any depth through composition... F(F(x, θ, ϕ), θ′, ϕ)=F(x, θ′′, ϕ)
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
constrained non-negative matrix mixing layers within a reversible mapping framework... Wt=αe(W0×W0)∼αI+W×W
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Jaideep Pathak, Brian Hunt, Michelle Girvan, Zhixin Lu, and Edward Ott. Model-free prediction of large spatiotemporally chaotic systems from data: A reservoir computing approach.Physical Review Letters, 120(2):024102, 2018
work page 2018
-
[2]
Ricardo Vinuesa and Steven L Brunton. Enhancing computational fluid dynamics with machine learning.Nature Computational Science, 2(6):358–366, 2022
work page 2022
-
[3]
Moonseop Han and Elizabeth Qian. Robust prediction of dynamical systems with structured neu- ral networks: Long-term behavior and chaos.Physica D: Nonlinear Phenomena, 427:133006, 2021
work page 2021
-
[4]
John Wiley & Sons, Hoboken, New Jersey, 5 edition, 2015
George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung.Time Series Analysis: Forecasting and Control. John Wiley & Sons, Hoboken, New Jersey, 5 edition, 2015
work page 2015
-
[5]
Steven L Brunton and J Nathan Kutz. Data-driven science and engineering: Machine learning, dynamical systems, and control.Cambridge University Press, 2019
work page 2019
-
[6]
Steven L Brunton and J Nathan Kutz. Promising directions of machine learning for partial differential equations.Nature Computational Science, 4(7):483–494, 2024
work page 2024
-
[7]
Next generation reservoir computing.Nature communications, 12(1):1–8, 2021
Daniel J Gauthier, Erik Bollt, Aaron Griffith, and Wendson AS Barbosa. Next generation reservoir computing.Nature communications, 12(1):1–8, 2021
work page 2021
-
[8]
Xin Li, Qunxi Zhu, Chengli Zhao, Xiaojun Duan, Bolin Zhao, Xue Zhang, Huanfei Ma, Jie Sun, and Wei Lin. Higher-order granger reservoir computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction.Nature Communications, 15(1):2506, 2024
work page 2024
-
[9]
Chao Du, Fuxi Cai, Mohammed A Zidan, Wen Ma, Seung Hwan Lee, and Wei D Lu. Reservoir computing using dynamic memristors for temporal information processing.Nature communica- tions, 8(1):2204, 2017
work page 2017
-
[10]
Filippo Maria Bianchi, Simone Scardapane, Sigurd Løkse, and Robert Jenssen. Reservoir computing approaches for representation and classification of multivariate time series.IEEE transactions on neural networks and learning systems, 32(5):2169–2179, 2020
work page 2020
-
[11]
Chaos as an interpretable benchmark for forecasting and data-driven modelling
William Gilpin. Chaos as an interpretable benchmark for forecasting and data-driven modelling. arXiv preprint arXiv:2110.05266, 2021
-
[12]
Pantelis R Vlachas, Jaideep Pathak, Brian R Hunt, Themistoklis P Sapsis, Michelle Girvan, Edward Ott, and Petros Koumoutsakos. Backpropagation algorithms and reservoir computing in recurrent neural networks for the forecasting of complex spatiotemporal dynamics.Neural Networks, 126:191–217, 2020
work page 2020
-
[13]
Min Yan, Can Huang, Peter Bienstman, Peter Tino, Wei Lin, and Jie Sun. Emerging opportuni- ties and challenges for the future of reservoir computing.Nature Communications, 15(1):2056, 2024
work page 2056
-
[14]
MIT Press, Cambridge, MA, 2012
Masashi Sugiyama and Motoaki Kawanabe.Machine Learning in Non-Stationary Environments: Introduction to Covariate Shift Adaptation. MIT Press, Cambridge, MA, 2012
work page 2012
-
[15]
Deterministic nonperiodic flow.Journal of atmospheric sciences, 20(2):130– 141, 1963
Edward N Lorenz. Deterministic nonperiodic flow.Journal of atmospheric sciences, 20(2):130– 141, 1963
work page 1963
-
[16]
An equation for continuous chaos.Physics Letters A, 57(5):397–398, 1976
Otto E Rössler. An equation for continuous chaos.Physics Letters A, 57(5):397–398, 1976
work page 1976
-
[17]
Y Aizawa. Chaos and fractals: Proceedings of the first ifip conference on fractals in the fundamental and applied sciences.North-Holland Delta Series, 1994
work page 1994
-
[18]
CRC Press, Boca Raton, 2 edition, 2018
Steven H Strogatz.Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering. CRC Press, Boca Raton, 2 edition, 2018. 10
work page 2018
-
[19]
Nguyen, Phanwadee Sinthong, and Jayant Kalagnanam
Yuqi Nie, Nam H. Nguyen, Phanwadee Sinthong, and Jayant Kalagnanam. A time series is worth 64 words: Long-term forecasting with transformers. InInternational Conference on Learning Representations, 2023
work page 2023
-
[20]
Timemixer++: A general time series pattern machine for universal predictive analysis
Shiyu Wang, Jiawei Li, Xiaoming Shi, Qinghe Liao, Mengan Liu, Qi Chen, Qingsong Wen, Zhiyuan Wang, Zijie Liu, Xiuwen Zhang, and Rong Jin. Timemixer++: A general time series pattern machine for universal predictive analysis. InarXiv preprint, 2024
work page 2024
-
[21]
Wpmixer: Efficient multi-resolution mixing for long-term time series forecasting
Md Mahmuddun Nabi Murad, Mehmet Aktukmak, and Yasin Yilmaz. Wpmixer: Efficient multi-resolution mixing for long-term time series forecasting. InProceedings of the AAAI Conference on Artificial Intelligence, 2025
work page 2025
-
[22]
Chimera: Effectively modeling multi- variate time series with 2-dimensional state space models
Ali Behrouz, Michele Santacatterina, and Ramin Zabih. Chimera: Effectively modeling multi- variate time series with 2-dimensional state space models. InAdvances in Neural Information Processing Systems, 2024
work page 2024
-
[23]
William Gilpin. Model scale versus domain knowledge in statistical forecasting of chaotic systems.Physical Review Research, 5(4):043252, 2023
work page 2023
-
[24]
Time-llm: Time series forecasting by reprogramming large language models
Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Zhang, Pin Tang, Hao Cheng, Yong Luo, and Shuicheng Yan. Time-llm: Time series forecasting by reprogramming large language models. InInternational Conference on Learning Representations, 2024
work page 2024
-
[25]
Zero-shot forecasting of chaotic systems.arXiv preprint arXiv:2409.15771, 2024
Yuanzhao Zhang and William Gilpin. Zero-shot forecasting of chaotic systems.arXiv preprint arXiv:2409.15771, 2024
- [26]
-
[27]
Peter J Schmid. Dynamic mode decomposition of numerical and experimental data.Journal of fluid mechanics, 656:5–28, 2010
work page 2010
-
[28]
Si-An Chen, Chun-Liang Li, Nathanael C Yoder, Sercan Ö Arık, and Tomas Pfister. Tsmixer: An all-mlp architecture for time series forecasting.arXiv preprint arXiv:2303.06053, 2023
-
[29]
Reversible instance normalization for accurate time-series forecasting against distribution shift
Taesu Kim, Jinhee Kim, Yunwon Seo, Jae Won Soh, and Jaegul Yoon. Reversible instance normalization for accurate time-series forecasting against distribution shift. InInternational Conference on Learning Representations, 2022
work page 2022
-
[30]
Attention is all you need.Advances in neural information processing systems, 30, 2017
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need.Advances in neural information processing systems, 30, 2017
work page 2017
-
[31]
Clive W.J. Granger and Roselyne Joyeux. An introduction to long-memory time series models and fractional differencing.Journal of Time Series Analysis, 1(1):15–29, 1980
work page 1980
-
[32]
J Nathan Kutz, Steven L Brunton, Bingni W Brunton, and Joshua L Proctor.Dynamic Mode Decomposition: Data-Driven Modeling of Complex Systems, volume 149 ofOther Titles in Applied Mathematics. SIAM, Philadelphia, 2016
work page 2016
-
[33]
Hassan Arbabi and Igor Mezic. Ergodic theory, dynamic mode decomposition, and computation of spectral properties of the koopman operator.SIAM Journal on Applied Dynamical Systems, 16(4):2096–2126, 2017
work page 2096
-
[34]
Chaos as an intermittently forced linear system.Nature communications, 8(1):19, 2017
Steven L Brunton, Bingni W Brunton, Joshua L Proctor, Eurika Kaiser, and J Nathan Kutz. Chaos as an intermittently forced linear system.Nature communications, 8(1):19, 2017
work page 2017
-
[35]
Bernard O Koopman. Hamiltonian systems and transformation in hilbert space.Proceedings of the National Academy of Sciences, 17(5):315–318, 1931
work page 1931
-
[36]
Steven L. Brunton, J. Nathan Kutz, and Joshua L. Proctor.Modern Koopman Theory for Dynamical Systems. Cambridge University Press, Cambridge, UK, 2022. 11
work page 2022
-
[37]
Peter J Baddoo, Benjamin Herrmann, Beverley J McKeon, J Nathan Kutz, and Steven L Brunton. Physics-informed dynamic mode decomposition.Proceedings of the Royal Society A, 479(2271):20220576, 2023
work page 2023
-
[38]
Modeling long-and short-term temporal patterns with deep neural networks
Guokun Lai, Wei-Cheng Chang, Yiming Yang, and Hanxiao Liu. Modeling long-and short-term temporal patterns with deep neural networks. InThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pages 95–104, 2018
work page 2018
-
[39]
Mshyper: Multi-scale hyper- graph transformer for long-range time series forecasting
Zongjiang Shang, Ling Chen, Binqing Wu, and Dongliang Cui. Mshyper: Multi-scale hyper- graph transformer for long-range time series forecasting. InarXiv preprint, 2024
work page 2024
-
[40]
Ada-mshyper: Adaptive multi- scale hypergraph transformer for time series forecasting
Zongjiang Shang, Ling Chen, Binqing Wu, and Dongliang Cui. Ada-mshyper: Adaptive multi- scale hypergraph transformer for time series forecasting. InAdvances in Neural Information Processing Systems, 2024
work page 2024
-
[41]
Shengsheng Lin, Weiwei Lin, Xinyi Hu, Wentai Wu, Ruichao Mo, and Haocheng Zhong. Cyclenet: enhancing time series forecasting through modeling periodic patterns.Advances in Neural Information Processing Systems, 37:106315–106345, 2024
work page 2024
-
[42]
Informer: Beyond efficient transformer for long sequence time-series forecasting
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 11106–11115, 2021
work page 2021
-
[43]
Springer, Berlin, Heidelberg, 2013
Martin Treiber and Arne Kesting.Traffic Flow Dynamics: Data, Models and Simulation. Springer, Berlin, Heidelberg, 2013
work page 2013
-
[44]
Bart De Schutter and Serge P Hoogendoorn. Modeling and control of freeway traffic flow by state space neural networks.Neural Computing and Applications, 17(2):175–185, 2008
work page 2008
-
[45]
Chuhan Yang, Fares B Mehouachi, Monica Menendez, and Saif Eddin Jabari. Urban traffic analysis and forecasting through shared koopman eigenmodes.arXiv preprint arXiv:2409.04728, 2024
-
[46]
Matteo Sangiorgio, Fabio Dercole, and Giorgio Guariso. Forecasting of noisy chaotic systems with deep neural networks.Chaos, Solitons & Fractals, 153:111570, 2021
work page 2021
-
[47]
Sanghyeon Choi, Jaeho Shin, Gwanyeong Park, Jung Sun Eo, Jingon Jang, J Joshua Yang, and Gunuk Wang. 3d-integrated multilayered physical reservoir array for learning and forecasting time-series information.Nature Communications, 15(1):2044, 2024
work page 2044
-
[48]
Deep learning.Nature, 521(7553):436–444, 2015
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning.Nature, 521(7553):436–444, 2015
work page 2015
-
[49]
Jason A Platt, Alexander S Wong, Ryan Clark, Stephen G Penny, and Henry DI Abarbanel. Robust forecasting using predictive generalized synchronization in reservoir computing.Chaos: An Interdisciplinary Journal of Nonlinear Science, 31(12):123118, 2021
work page 2021
-
[50]
Jaideep Pathak, Brian R Hunt, Georg M Goerg, and Themistoklis P Sapsis. Data-driven prediction of chaotic dynamics: Methods, challenges, and opportunities.Annual Review of Condensed Matter Physics, 14:379–401, 2023
work page 2023
-
[51]
Boris N Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. N-beats: Neural basis expansion analysis for interpretable time series forecasting.arXiv preprint arXiv:2002.05746, 2020
-
[52]
Machine learning for fluid mechanics.Annual Review of Fluid Mechanics, 52:477–508, 2020
Steven L Brunton, Bernd R Noack, and Petros Koumoutsakos. Machine learning for fluid mechanics.Annual Review of Fluid Mechanics, 52:477–508, 2020
work page 2020
-
[53]
Maziar Raissi, Paris Perdikaris, and George E Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019
work page 2019
-
[54]
Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021
George Em Karniadakis, Ioannis G Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021. 12
work page 2021
-
[55]
Sifan Wang, Xinling Yu, and Paris Perdikaris. When and why pinns fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022
work page 2022
-
[56]
Maziar Raissi and George Em Karniadakis. Hidden physics models: Machine learning of nonlinear partial differential equations.Journal of Computational Physics, 357:125–141, 2020
work page 2020
-
[57]
Towards physics- informed deep learning for turbulent flow prediction
Rui Wang, Karthik Kashinath, Mustafa Mustafa, Adrian Albert, and Rose Yu. Towards physics- informed deep learning for turbulent flow prediction. InProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1457–1466. ACM, 2020
work page 2020
-
[58]
Alejandro Güemes, Stefano Discetti, and Andrea Ianiro. Coarse-grained physics-based predic- tion of three-dimensional unsteady flows via neural networks.Science Advances, 7(46):eabj0751, 2021
work page 2021
-
[59]
Arvind T Mohan, Nicholas Lubbers, Daniel Livescu, and Michael Chertkov. Embedding hard physical constraints in neural network coarse-graining of 3d turbulence.arXiv preprint arXiv:2002.00021, 2020
-
[60]
Sifan Wang, Shyam Sankaran, and Paris Perdikaris. Learning the solution operator of para- metric partial differential equations with physics-informed deeponets.Science Advances, 7(40):eabi8605, 2021
work page 2021
-
[61]
Tom Beucler, Michael Pritchard, Stephan Rasp, Jordan Ott, Pierre Baldi, and Pierre Gentine. Enforcing analytic constraints in neural networks emulating physical systems.Physical Review Letters, 126(9):098302, 2021
work page 2021
-
[62]
N-hits: Neural hierarchical interpolation for time series forecasting
Cristian Challu, Kin G Olivares, Boris N Oreshkin, Federico Garza, Max Mergenthaler-Canseco, and Artur Dubrawski. N-hits: Neural hierarchical interpolation for time series forecasting. arXiv preprint arXiv:2201.12886, 2022
-
[63]
Tian Zhou, Ziqing Ma, Qingsong Wen, Xue Wang, Liang Sun, and Rong Jin. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting.In International Conference on Machine Learning, pages 26501–26518, 2022
work page 2022
-
[64]
Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting
Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. InAdvances in Neural Information Processing Systems, volume 34, pages 22419–22430, 2021
work page 2021
-
[65]
Ailing Zeng, Muxi Chen, Lei Zhang, and Qiang Xu. Are transformers effective for time series forecasting? InProceedings of the AAAI conference on artificial intelligence, volume 37, pages 11121–11128, 2023
work page 2023
-
[66]
Yong Liu, Chenyu Li, Jianmin Wang, and Mingsheng Long. Koopa: Learning non-stationary time series dynamics with koopman predictors.Advances in neural information processing systems, 36:12271–12290, 2023
work page 2023
-
[67]
Qi Rose Yu, Ray Wang, Sercan Arik, and Yihe Dong. Koopman neural forecaster for time-series with temporal distribution shifts.Proceedings of ICLR, 2, 2023
work page 2023
-
[68]
Exact stochastic newton method for deep learning: the feedforward networks case
Fares B Mehouachi and Chaouki Kasmi. Exact stochastic newton method for deep learning: the feedforward networks case. 2022
work page 2022
-
[69]
Exact stochastic second order deep learning.arXiv preprint arXiv:2104.03804, 2021
Fares B Mehouachi and Chaouki Kasmi. Exact stochastic second order deep learning.arXiv preprint arXiv:2104.03804, 2021
-
[70]
Fourier Neural Operator for Parametric Partial Differential Equations
Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial differen- tial equations.arXiv preprint arXiv:2010.08895, 2020
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[71]
Fares B Mehouachi, Qingjie Yang, Juan Galvis, Santiago Morales, Milosch Meriac, Felix Vega, and Chaouki Kasmi. Detection of uavs based on spectrum monitoring and deep learning in negative snr conditions.URSI Radio Sci. Lett., 3:43, 2021. 13
work page 2021
-
[72]
F Mehouachi, Juan Galvis, Santiago Morales, Milosch Meriac, Felix Vega, and Chaouki Kasmi. Detection and identification of uavs based on spectrum monitoring and deep learning in negative snr conditions.URSI GASS, 2021
work page 2021
-
[73]
Fares B Mehouachi and Saif Eddin Jabari. Catastrophic overfitting, entropy gap and participation ratio: A noiseless lp norm solution for fast adversarial training.arXiv preprint arXiv:2505.02360, 2025
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[74]
Adam: A method for stochastic optimization
Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. InInterna- tional Conference on Learning Representations, 2014
work page 2014
-
[75]
Decoupled Weight Decay Regularization
Ilya Loshchilov and Frank Hutter. Fixing weight decay regularization in adam.arXiv preprint arXiv:1711.05101, 2017. 14 Impact Statement FlowMixer introduces an interpretable approach to spatiotemporal forecasting that bridges statistical learning with dynamical systems theory. By employing constrained matrix operations within a reversible mapping framewor...
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[76]
Automatic Parameter Tuning: FlowMixer learns the optimal α directly from data rather than requiring a priori estimation
-
[77]
Computational Tractability: The first-order approximation maintains efficiency while capturing essential long-memory characteristics Through this mechanism, FlowMixer effectively addresses non-stationarity through a theoretically grounded, differentiable approach inspired by classical time series models. 24 J Chaotic Systems Prediction J.1 Overview We eva...
work page 2018
-
[78]
We first compute an intermediate velocity fieldu ∗ without the pressure gradient: u∗ −u n ∆t =−(u n · ∇)un + 1 Re ∇2un
-
[79]
The viscous terms are treated using an ADI scheme to ensure stability: (I− ∆t 2Re ∂2 ∂x2 )ˆu=u n + ∆t 2Re ∂2un ∂y 2 (I− ∆t 2Re ∂2 ∂y 2 )u∗ = ˆu+ ∆t 2Re ∂2ˆu ∂x2 (29)
-
[80]
The pressure Poisson equation is then solved to enforce incompressibility: ∇2pn+1 = 1 ∆t ∇ ·u ∗ (30)
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.