Inf-SSM constrains the infinite-horizon evolution of SSMs via Grassmannian geometry and an efficient O(n^2) Sylvester solver to enable exemplar-free continual learning with reduced forgetting.
Ishaan Watts, Catherine Li, Sachin Goyal, Jacob Mitchell Springer, and Aditi Raghunathan
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
Learning rate decay during SFT increases pretrained model sharpness, which exacerbates catastrophic forgetting and causes overtraining in LLMs.
Proposes the CBDT framework as a minimum viable digital twin for CI builds to enable real-time monitoring, ML modeling, and prescriptive optimization of build duration, failures, and flakiness.
citing papers explorer
-
Exemplar-Free Continual Learning for State Space Models
Inf-SSM constrains the infinite-horizon evolution of SSMs via Grassmannian geometry and an efficient O(n^2) Sylvester solver to enable exemplar-free continual learning with reduced forgetting.
-
(How) Learning Rates Regulate Catastrophic Overtraining
Learning rate decay during SFT increases pretrained model sharpness, which exacerbates catastrophic forgetting and causes overtraining in LLMs.
-
Towards Build Optimization Using Digital Twins
Proposes the CBDT framework as a minimum viable digital twin for CI builds to enable real-time monitoring, ML modeling, and prescriptive optimization of build duration, failures, and flakiness.