Reconstructing Noisy Gene Regulation Dynamics Using Extrinsic-Noise-Driven Neural Stochastic Differential Equations
Pith reviewed 2026-05-23 01:06 UTC · model grok-4.3
The pith
An extrinsic-noise-driven neural SDE reconstructs gene regulation dynamics from heterogeneous cell trajectories by matching Wasserstein distances.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central discovery is that driving a neural SDE with an extrinsic noise term and optimizing it to match trajectory distributions via the Wasserstein distance enables accurate reconstruction of noisy gene regulation dynamics, including how extrinsic heterogeneity modulates the intrinsic stochastic reactions, as shown in three cell biology systems.
What carries the argument
The extrinsic-noise-driven neural stochastic differential equation (END-nSDE) framework that augments neural SDEs with extrinsic noise and uses Wasserstein distance for distribution matching.
If this is right
- It models how cellular heterogeneity modulates reaction dynamics alongside intrinsic noise.
- It outperforms RNNs and LSTMs in reconstructing the dynamics from the tested systems.
- By inferring heterogeneities, it reproduces the noisy experimental dynamics.
- It provides a surrogate model for complex biophysical processes where mechanistic models are hard to build.
Where Pith is reading between the lines
- Applying this to other noisy biological systems could help predict how environmental changes affect population-level behavior through altered heterogeneity.
- Combining it with single-cell data might reveal which molecular factors drive the inferred extrinsic noise.
- Validating against systems with independently measured noise sources would test if the separation of intrinsic and extrinsic effects holds.
Load-bearing premise
Trajectory data from heterogeneous cells can be reconstructed accurately by a neural SDE driven by extrinsic noise when distributions are matched using Wasserstein distance, without needing system-specific mechanistic details.
What would settle it
A new experiment on one of the systems where cell heterogeneity is deliberately reduced or increased, showing that the reconstructed model does not adjust its predictions accordingly.
Figures
read the original abstract
Proper regulation of cell signaling and gene expression is crucial for maintaining cellular function, development, and adaptation to environmental changes. Reaction dynamics in cell populations is often noisy because of (i) inherent stochasticity of intracellular biochemical reactions (``intrinsic noise'') and (ii) heterogeneity of cellular states across different cells that are influenced by external factors (``extrinsic noise''). In this work, we introduce an extrinsic-noise-driven neural stochastic differential equation (END-nSDE) framework that utilizes the Wasserstein distance to accurately reconstruct SDEs from trajectory data from a heterogeneous population of cells (extrinsic noise). We demonstrate the effectiveness of our approach using both simulated and experimental data from three different systems in cell biology: (i) circadian rhythms, (ii) RPA-DNA binding dynamics, and (iii) NF$\kappa$B signaling process. Our END-nSDE reconstruction method can model how cellular heterogeneity (extrinsic noise) modulates reaction dynamics in the presence of intrinsic noise. It also outperforms existing time-series analysis methods such as recurrent neural networks (RNNs) and long short-term memory networks (LSTMs). By inferring cellular heterogeneities from data, our END-nSDE reconstruction method can reproduce noisy dynamics observed in experiments. In summary, the reconstruction method we propose offers a useful surrogate modeling approach for complex biophysical processes, where high-fidelity mechanistic models may be impractical.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces an extrinsic-noise-driven neural stochastic differential equation (END-nSDE) framework that trains a neural SDE by driving it with an extrinsic noise process and minimizing Wasserstein distance to observed single-cell trajectories. The central claim is that this reconstructs how cellular heterogeneity modulates intrinsic reaction dynamics in gene regulation, demonstrated on simulated and experimental data from circadian rhythms, RPA-DNA binding, and NFκB signaling, while outperforming RNNs and LSTMs as a surrogate model.
Significance. If the separation of extrinsic modulation from intrinsic noise holds and the learned dynamics are identifiable, the approach could offer a practical data-driven surrogate for noisy biophysical processes where mechanistic models are intractable, extending neural SDE methods to heterogeneous cell populations.
major comments (2)
- [Abstract and §3] Abstract and §3 (Methods): the claim that END-nSDE 'models how cellular heterogeneity modulates reaction dynamics' requires that the extrinsic driver plus learned drift/diffusion isolate modulation effects rather than merely matching marginal trajectory statistics via Wasserstein distance. No identifiability argument, recovery of known modulation parameters, or out-of-distribution test is described for the three cases, leaving the separation assumption untested.
- [§4] §4 (Results, circadian/RPA/NFκB cases): the reported outperformance over RNNs/LSTMs is stated without quantitative metrics, error bars, or ablation on the extrinsic-noise driver component, so it is unclear whether gains arise from the extrinsic driver or from the neural SDE architecture alone.
minor comments (2)
- [§2] Notation for the extrinsic noise process and its coupling to the neural SDE should be defined explicitly with an equation in §2 or §3.
- [Abstract] The abstract states 'reproduce noisy dynamics observed in experiments' but does not specify which experimental observables (e.g., period, amplitude distributions) are matched.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We address each major comment below.
read point-by-point responses
-
Referee: [Abstract and §3] Abstract and §3 (Methods): the claim that END-nSDE 'models how cellular heterogeneity modulates reaction dynamics' requires that the extrinsic driver plus learned drift/diffusion isolate modulation effects rather than merely matching marginal trajectory statistics via Wasserstein distance. No identifiability argument, recovery of known modulation parameters, or out-of-distribution test is described for the three cases, leaving the separation assumption untested.
Authors: The END-nSDE framework is constructed precisely to achieve this separation: an explicit extrinsic noise process (sampled to reflect observed cellular heterogeneity) is provided as a time-varying driver to the neural SDE, while the neural network learns only the drift and diffusion that govern the underlying reaction dynamics. Minimizing Wasserstein distance then aligns the generated trajectory distributions with those observed in heterogeneous populations. This design ensures the learned components capture modulation effects rather than merely reproducing marginal statistics. Although the manuscript does not contain a formal identifiability proof or out-of-distribution parameter-recovery tests, the consistent performance on both simulated data (where ground-truth modulation is known) and three experimental systems supports the validity of the approach for data-driven reconstruction. We therefore maintain the claim as stated and do not plan to alter it. revision: no
-
Referee: [§4] §4 (Results, circadian/RPA/NFκB cases): the reported outperformance over RNNs/LSTMs is stated without quantitative metrics, error bars, or ablation on the extrinsic-noise driver component, so it is unclear whether gains arise from the extrinsic driver or from the neural SDE architecture alone.
Authors: We agree that §4 would benefit from additional quantitative detail. In the revised manuscript we will report explicit performance metrics (e.g., Wasserstein distances and trajectory prediction errors) with error bars across multiple random seeds, together with an ablation that removes the extrinsic-noise driver while retaining the neural SDE architecture. revision: yes
Circularity Check
No circularity: data-driven neural SDE reconstruction is self-contained
full rationale
The paper presents END-nSDE as a neural SDE framework trained on trajectory data via Wasserstein distance matching to reconstruct dynamics under extrinsic noise. No derivation chain reduces a claimed result to its inputs by construction, no self-citations are load-bearing for the central method, and no fitted parameters are renamed as independent predictions. The approach is a standard surrogate modeling procedure validated on three separate biological cases, remaining self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce an extrinsic-noise-driven neural stochastic differential equation (END-nSDE) framework that utilizes the Wasserstein distance to accurately reconstruct SDEs from trajectory data from a heterogeneous population of cells
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Our END-nSDE reconstruction method can model how cellular heterogeneity (extrinsic noise) modulates reaction dynamics in the presence of intrinsic noise
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Noisy oscillatory circadian clock model Circadian clocks, often with a typical period of approx- imately 24 hours, are ubiquitous in intrinsically noisy bi- ological rhythms generated at the single-cell molecular level [36]. We consider a minimal SDE model of the periodic gene dynamics responsible for per gene expression which is critical in the circadian...
-
[2]
RPA-DNA binding model Regulation of gene expression relies on complex in- teractions between proteins and DNA, often described by the kinetics of binding and dissociation. Replication protein A (RPA) plays a pivotal role in various DNA metabolic pathways, including DNA replication and re- pair, through its dynamic binding with single-stranded DNA (ssDNA) ...
-
[3]
NF κB signaling model Macrophages can sense environmental information and respond accordingly with stimulus-response specificity encoded in signaling pathways and decoded by down- stream gene expression profiles [45]. The temporal dy- namics of NFκB, a key transcription factor in immune response and inflammation, encodes stimulus informa- tion [46]. NF κB...
-
[4]
(10) and established models [46, 51]
Reconstructing a 52-dimensional stochastic model for NFκB dynamics For training END-nSDE models, we first generate syn- thetic data from the 52-dimensional SDE model of NFκB signaling dynamics Eqs. (10) and established models [46, 51]. The synthetic trajectories are generated under 121 combinations of noise intensity(σ1,σ2) in Eqs. (10) (see Appendix D). ...
-
[5]
ground truth ground truth prediction 0 50 100 150 D times (min) prediction vs
Reconstructing NFκB experimental data with END-nSDE We then assess whether our proposed END-nSDE can accurately reconstruct the experimentally measured 10 0 0.1 0.2 0.3 (σ1, σ 2) = (10 −3.2, 10−2.5) A nucleus NFκB activity ground truth trajectories (σ1, σ 2) = (10 −2.2, 10−1.5) B ground truth trajectories 0 50 100 1500 0.1 0.2 0.3 C times (min) nucleus NF...
-
[6]
Weusedexperimentallymeasuredsingle-celltrajec- tories of NFκB activity, obtained through live-cell image tracking of macrophages from mVenus- tagged RelA mouse with a frame frequency of five minutes [55]. These trajectories correspond to the sum of nuclear IκBα−NFκB and NFκB in the 52D SDE model (u5(t) andu10(t) in Eqs. (10))
-
[7]
The experimental dataset was divided into sub- groups. Cosine similarity was calculated between the ODE-generated trajectory (representative-cell NFκB dynamics) and experimental trajectories. The trajectories were then ranked and divided into different groups based on their cosine similarity with the ODE model. Experimental trajectories with higher simila...
-
[8]
For simplicity, we assume that trajectories within each group shared the same noise intensities
Each group of experimental trajectories was input into the trained neural network (see the next para- graph for more details) to infer the corresponding noise intensities(σ1,σ2). For simplicity, we assume that trajectories within each group shared the same noise intensities
-
[9]
The inferred noise is then used as inputs for the trained END-nSDE to simulate NFκB trajectories
-
[10]
The simulated trajectories were compared with the corresponding experimental data to evaluate the model’s performance. To estimate noise intensities from different groups of experimentallymeasuredsingle-cellnuclearNF κBtrajec- tories (step (3) in the proposed workflow), we trained an- other neural network to predict the corresponding IκBα transcription an...
-
[11]
The predicted noise intensities on the experiment data set are larger than the noise intensities on the train- ingset, andtheunderlyingreasoncouldbeextrinsicnoise which is not taken into account interferes with the infer- ence of noise intensity. The transcription noise of IκBα is predicted to be within the range of[10−0.81, 10−0.71] (see Fig. 8D). In add...
work page 2024
-
[12]
P. S. Swain, M. B. Elowitz, and E. D. Siggia, Proceedings of the National Academy of Sciences99, 12795 (2002)
work page 2002
-
[13]
M.B.Elowitz, A.J.Levine, E.D.Siggia,andP.S.Swain, Science 297, 1183 (2002)
work page 2002
-
[14]
A. Sanchez, S. Choubey, and J. Kondev, Annual Review of Biophysics42, 469 (2013)
work page 2013
- [15]
-
[16]
S. Mitchell and A. Hoffmann, Current Opinion in Sys- tems Biology8, 39 (2018)
work page 2018
-
[17]
M. Thattai and A. Van Oudenaarden, Proceedings of the National Academy of Sciences98, 8614 (2001)
work page 2001
-
[18]
L.S.Tsimring,ReportsonProgressinPhysics 77,026601 (2014)
work page 2014
-
[19]
A. Q. Fu and L. Pachter, Statistical Applications in Ge- netics and Molecular Biology15, 447 (2016)
work page 2016
-
[20]
A. Llamosi, A. M. Gonzalez-Vargas, C. Versari, E. Cin- quemani, G. Ferrari-Trecate, P. Hersen, and G. Batt, PLoS Computational Biology12, e1004706 (2016)
work page 2016
-
[21]
L. Dharmarajan, H.-M. Kaltenbach, F. Rudolf, and J. Stelling, Cell Systems8, 15 (2019)
work page 2019
-
[22]
B. Finkenstädt, D. J. Woodcock, M. Komorowski, C. V. Harper, J. R. E. Davis, M. R. H. White, and D. A. Rand, The Annals of Applied Statistics , 1960 (2013)
work page 1960
-
[23]
P. D. Dixit, Biophysical Journal104, 2743 (2013)
work page 2013
-
[24]
Z. Fang, A. Gupta, S. Kumar, and M. Khammash, Na- ture Communications15, 4911 (2024)
work page 2024
-
[25]
A. Raj, C. S. Peskin, D. Tranchina, D. Y. Vargas, and S. Tyagi, PLoS Biology4, e309 (2006)
work page 2006
-
[26]
J. M. Raser and E. K. O’Shea, Science304, 1811 (2004)
work page 2004
- [27]
-
[28]
A.Singh, B.S.Razooky, R.D.Dar,andL.S.Weinberger, Molecular Systems Biology8, 607 (2012)
work page 2012
-
[29]
Paulsson, Physics of Life Reviews2, 157 (2005)
J. Paulsson, Physics of Life Reviews2, 157 (2005)
work page 2005
- [30]
-
[31]
D.-G. Wang, S. Wang, B. Huang, and F. Liu, Scientific reports 9, 5883 (2019)
work page 2019
-
[32]
E. O. Voit, H. A. Martens, and S. W. Omholt, PLoS Computational Biology11, e1004012 (2015)
work page 2015
-
[33]
Ferner, British Journal of Clinical Pharmacology81 (2015)
R. Ferner, British Journal of Clinical Pharmacology81 (2015)
work page 2015
-
[34]
P. C. Bressloff and J. M. Newby, SIAM Journal on Ap- plied Dynamical Systems12, 1394 (2013)
work page 2013
-
[35]
T. G. Kurtz, Limit theorems and diffusion approxima- tions for density dependent markov chains, inStochas- tic Systems: Modeling, Identification and Optimization, I, edited by R. J.-B. Wets (Springer Berlin Heidelberg, Berlin, Heidelberg, 1976) pp. 67–78
work page 1976
-
[36]
T. Tian, K. Burrage, P. M. Burrage, and M. Car- letti, Journal of Computational and Applied Mathemat- ics 205, 696 (2007), special issue on evolutionary prob- lems
work page 2007
-
[37]
K.-C. Chen, T.-Y. Wang, H.-H. Tseng, C.-Y. F. Huang, and C.-Y. Kao, Bioinformatics21, 2883 (2005)
work page 2005
- [38]
-
[39]
C. Zechner, M. Unger, S. Pelet, M. Peter, and H. Koeppl, Nature Methods11, 197 (2014)
work page 2014
- [40]
- [41]
- [42]
- [43]
-
[44]
M. Xia, X. Li, Q. Shen, and T. Chou, Machine Learning: Science and Technology5, 045052 (2024)
work page 2024
- [45]
- [46]
- [47]
-
[48]
P. O. Westermark, D. K. Welsh, H. Okamura, and H. Herzel, PLoS Computational Biology5, 1 (2009)
work page 2009
- [49]
-
[50]
C. C. Caldwell and M. Spies, Critical Reviews in Bio- chemistry and Molecular Biology55, 482 (2020)
work page 2020
-
[51]
Zou, Molecular Cell65, 832 (2017)
H.D.Nguyen, T.Yadav, S.Giri, B.Saez, T.A.Graubert, and L. Zou, Molecular Cell65, 832 (2017). 15
work page 2017
-
[52]
M. S. Wold, Annual Review of Biochemistry 66, 61 (1997)
work page 1997
-
[53]
J. Ding, X. Li, J. Shen, Y. Zhao, S. Zhong, L. Lai, H. Niu, and Z. Qi, Nature Communications14, 3864 (2023)
work page 2023
-
[54]
D. T. Gillespie, The Journal of Physical Chemistry81, 2340 (1977)
work page 1977
-
[55]
D. T. Gillespie, The Journal of Chemical Physics113, 297 (2000)
work page 2000
-
[56]
K. M. Sheu, S. Luecke, and A. Hoffmann, Current Opin- ion in Systems Biology18, 53 (2019)
work page 2019
-
[58]
A. Hoffmann, A. Levchenko, M. L. Scott, and D. Balti- more, Science298, 1241 (2002)
work page 2002
-
[59]
Q. J. Cheng, S. Ohta, K. M. Sheu, R. Spreafico, A. Ade- laja, B. Taylor, and A. Hoffmann, Science 372, 1349 (2021)
work page 2021
-
[60]
S. Sen, Z. Cheng, K. M. Sheu, Y. H. Chen, and A. Hoff- mann, Cell systems10, 169 (2020)
work page 2020
-
[61]
A. Adelaja, B. Taylor, K. M. Sheu, Y. Liu, S. Luecke, and A. Hoffmann, Immunity54, 916 (2021)
work page 2021
-
[62]
X. Guo, A. Adelaja, S. Sen, A. Singh, and A. Hoffmann, Modeling NFkB dynamics of single immune cells re- veals the heterogeneity of single-cell stimulus-specificities (2024)
work page 2024
-
[63]
Bressloff,Stochastic processes in cell biology, Vol
P.C. Bressloff,Stochastic processes in cell biology, Vol. 41 (Springer, 2014)
work page 2014
-
[64]
B. Spagnolo, S. Spezia, L. Curcio, N. Pizzolato, A. Fias- conaro, D. Valenti, P. Lo Bue, E. Peri, and S. Colazza, The European Physical Journal B69, 133 (2009)
work page 2009
- [65]
- [66]
-
[67]
C. V. Rao, D. M. Wolf, and A. P. Arkin, Nature420, 231 (2002)
work page 2002
-
[68]
A. M. Arias and P. Hayward, Nature Reviews Genetics 7, 34 (2006)
work page 2006
- [69]
-
[70]
C. Fronk and L. Petzold, Chaos: An Interdisciplinary Journal of Nonlinear Science33, 043101 (2023)
work page 2023
-
[71]
L. Böttcher and T. Asikis, Machine Learning: Science and Technology3, 045004 (2022)
work page 2022
-
[72]
L. Böttcher and G. Wheeler, Journal of Statistical Me- chanics: Theory and Experiment2024, 023401 (2024)
work page 2024
-
[73]
L.L.Fonseca, L.Böttcher, B.Mehrad,andR.C.Lauben- bacher, PLoS Computational Biology 21, e1012138 (2025)
work page 2025
-
[74]
L. Böttcher, L. L. Fonseca, and R. C. Laubenbacher, Philosophical Transactions of the Royal Society A383, 20240228 (2025)
work page 2025
- [75]
-
[76]
L. Böttcher, N. Antulov-Fantulin, and T. Asikis, Nature Communications 13, 333 (2022)
work page 2022
-
[77]
D. Cao, M. Chan, and S. Ng, Computation11 (2023)
work page 2023
-
[78]
How to Construct Deep Recurrent Neural Networks
R. Pascanu, C. Gulcehre, K. Cho, and Y. Bengio, How to construct deep recurrent neural networks (2014), arXiv:1312.6026 [cs.NE]
work page internal anchor Pith review Pith/arXiv arXiv 2014
- [79]
-
[80]
H. Wang, C. Li, J. Zhang, J. Wang, Y. Ma, and Y. Lian, Journal of Bioinformatics and Computational Biology 17, 1950022 (2019)
work page 2019
- [81]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.