A framework to identify and convert foldable layer normalizations to RMSNorm for exact equivalence and faster inference in deep neural networks.
Minimum width for universal approximation
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
Universal Differential Equations unify scientific models with machine learning by embedding flexible approximators into differential equations, enabling applications from biological mechanism discovery to high-dimensional optimization.
Autoencoders enable nonlinear dimensionality reduction for parametric ODEs, with analysis of exact representation properties and convergence of the reduced model to the original.
citing papers explorer
-
Enjoy Your Layer Normalization with the Computational Efficiency of RMSNorm
A framework to identify and convert foldable layer normalizations to RMSNorm for exact equivalence and faster inference in deep neural networks.
-
Universal Differential Equations for Scientific Machine Learning
Universal Differential Equations unify scientific models with machine learning by embedding flexible approximators into differential equations, enabling applications from biological mechanism discovery to high-dimensional optimization.
-
Model reduction of parametric ordinary differential equations via autoencoders: representation properties and convergence analysis
Autoencoders enable nonlinear dimensionality reduction for parametric ODEs, with analysis of exact representation properties and convergence of the reduced model to the original.