pith. sign in

Simulation as Supervision: Mechanistic Pretraining for Scientific Discovery

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

Scientific modeling faces a tradeoff between the interpretability of mechanistic theory and the predictive power of machine learning. While existing hybrid approaches have made progress by incorporating domain knowledge into machine learning methods as functional constraints, they can be limited by a reliance on precise mathematical specifications. When the underlying equations are partially unknown or misspecified, enforcing rigid constraints can introduce bias and hinder a model's ability to learn from data. We introduce Simulation-Grounded Neural Networks (SGNNs), a framework that incorporates scientific theory by using mechanistic simulations as training data for neural networks. By pretraining on diverse synthetic corpora that span multiple model structures and realistic observational noise, SGNNs internalize the underlying dynamics of a system as a structural prior. We evaluated SGNNs across multiple disciplines, including epidemiology, ecology, social science, and chemistry. In forecasting tasks, SGNNs outperformed both standard data-driven baselines and physics-constrained hybrid models. They nearly tripled the forecasting skill of the average CDC models in COVID-19 mortality forecasts and accurately forecasted high-dimensional ecological systems. SGNNs demonstrated robustness to model misspecification, performing well even when trained on data with incorrect assumptions. Our framework also introduces back-to-simulation attribution, a method for mechanistic interpretability that explains real-world dynamics by identifying their most similar counterparts within the simulated corpus. By unifying these techniques into a single framework, we demonstrate that diverse mechanistic simulations can serve as effective training data for robust scientific inference.

fields

cs.AI 1 cs.LG 1

years

2026 1 2025 1

verdicts

UNVERDICTED 2

representative citing papers

In-Context Learning Under Regime Change

cs.LG · 2026-04-18 · unverdicted · novelty 6.0

Transformers can solve in-context change-point detection with model size scaling by knowledge of the shift timing, matching optimal baselines on synthetic data and improving pretrained models on disease and financial forecasting.

citing papers explorer

Showing 2 of 2 citing papers.

  • Mantis: A Foundation Model for Mechanistic Disease Forecasting cs.AI · 2025-08-17 · unverdicted · none · ref 8 · internal anchor

    A foundation model trained only on disease simulations achieves top-ranked forecasting accuracy across 16 diseases and beats all CDC COVID-19 hub models on early unseen pandemic data.

  • In-Context Learning Under Regime Change cs.LG · 2026-04-18 · unverdicted · none · ref 25 · internal anchor

    Transformers can solve in-context change-point detection with model size scaling by knowledge of the shift timing, matching optimal baselines on synthetic data and improving pretrained models on disease and financial forecasting.