Marginal Girsanov Reweighting: Stable Variance Reduction for Long-Timescale Dynamics from Biased Simulation

Hao Wu; Simon Olsson; Yan Wang

arxiv: 2509.25872 · v2 · submitted 2025-09-30 · 🧬 q-bio.QM · q-bio.BM

Marginal Girsanov Reweighting: Stable Variance Reduction for Long-Timescale Dynamics from Biased Simulation

Yan Wang , Hao Wu , Simon Olsson This is my paper

Pith reviewed 2026-05-18 12:02 UTC · model grok-4.3

classification 🧬 q-bio.QM q-bio.BM

keywords Marginal Girsanov Reweightingvariance reductionenhanced samplingmolecular dynamicskinetic propertiesreweighting methodsumbrella samplingmetadynamics

0 comments

The pith

Marginal Girsanov Reweighting keeps reweighting weights stable for long molecular dynamics by averaging over intermediate paths.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Classical Girsanov Reweighting provides exact ratios between biased and unbiased simulation paths but suffers from variance that grows quickly with simulation length, making it unusable for long timescales. The paper proposes Marginal Girsanov Reweighting to fix this by marginalizing over intermediate paths instead of using the full path probability. This produces weights that remain practical while still delivering unbiased estimates of kinetic and thermodynamic properties from simulations run with common biasing methods like umbrella sampling and metadynamics. A sympathetic reader would care because enhanced sampling is essential for studying rare events in molecules, but extracting correct long-time behavior from those runs has been a bottleneck.

Core claim

The central discovery is that by marginalizing over intermediate paths, one can derive reweighting factors whose variance does not explode with time. This Marginal Girsanov Reweighting yields stable, scalable weights that allow accurate recovery of unbiased kinetic observables from trajectories generated under umbrella sampling and metadynamics biases on various molecular systems.

What carries the argument

Marginal Girsanov Reweighting, a reweighting scheme that marginalizes over intermediate paths to compute stable probability ratios between biased and unbiased processes.

If this is right

Biased simulations can now be used to obtain reliable long-timescale kinetic information without prohibitive variance in the weights.
The method works for both umbrella sampling and metadynamics, suggesting broad applicability to enhanced sampling techniques.
Accurate recovery of kinetic properties becomes feasible for longer horizons than classical Girsanov Reweighting allows.
Scalable weights enable analysis of dynamics in larger or more complex molecular systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach might extend to other path-based sampling methods in statistical mechanics beyond molecular dynamics.
Researchers could test MGR on systems with known exact kinetics from direct long simulations to verify accuracy at even longer times.
Integration with existing enhanced sampling software could make unbiased analysis routine for rare-event studies.

Load-bearing premise

The marginalization over intermediate paths keeps the reweighting exactly unbiased while sufficiently reducing variance growth for the biases and systems considered.

What would settle it

Running MGR on a molecular system for which independent long unbiased simulations provide ground-truth kinetics, and finding that the recovered rates or distributions deviate beyond statistical error, would disprove the accuracy claim.

Figures

Figures reproduced from arXiv: 2509.25872 by Hao Wu, Simon Olsson, Yan Wang.

**Figure 2.** Figure 2: The training algorithm of MGR as illustrated [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 2.** Figure 2: In the next subsection, we introduce the ratio estimation method used in MGR [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Results for the one-dimensional four-well potential system. [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Results for the 22 atoms Alanine Dipeptide system. [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Results for Bayesian inference in the Graph Ornstein–Uhlenbeck process. [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Results for Bayesian inference in the Stochastic Lotka-Volterra system. [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗

read the original abstract

Recovering unbiased kinetic and thermodynamic observables from the enhanced sampling simulations is a central challenge in rare-event sampling. Classical Girsanov Reweighting (GR) offers a principled solution by yielding exact pathwise probability ratios between biased and unbiased processes. However, the variance of GR weights grows rapidly with time, rendering it impractical for long-horizon reweighting. We introduce Marginal Girsanov Reweighting (MGR), which mitigates variance explosion by marginalizing over intermediate paths, producing stable and scalable weights for long-timescale dynamics. Experiments on various molecular dynamics systems demonstrate that MGR accurately recovers unbiased kinetic properties from trajectories generated under both umbrella sampling and metadynamics biases.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Marginal Girsanov Reweighting (MGR) as a variance-reduction technique for reweighting long-timescale trajectories generated under biased sampling (umbrella sampling and history-dependent metadynamics). It extends classical Girsanov reweighting by marginalizing over intermediate paths to produce stable weights while claiming to recover unbiased kinetic and thermodynamic observables from molecular dynamics simulations.

Significance. If the central unbiasedness claim holds for kinetic observables, MGR would address a practical bottleneck in enhanced sampling by enabling reliable reweighting over timescales where standard pathwise Girsanov weights become unusable due to variance growth. The work builds directly on established Radon-Nikodym derivatives between biased and unbiased path measures and supplies a concrete algorithmic modification.

major comments (2)

[§3] §3 (Theoretical construction of MGR weights): the argument that marginalization over intermediate paths yields an unbiased estimator of the path probability ratio for kinetic observables (e.g., time-correlation functions or transition rates) is not fully secured. Classical GR supplies exact pathwise ratios; the manuscript must explicitly derive or verify that the expectation of the marginal weight recovers the correct marginal ratio while preserving time ordering, rather than assuming it follows automatically from the equilibrium case.
[Experiments] Experiments section (results on molecular systems): the abstract and reported demonstrations assert accurate recovery of unbiased kinetic properties, yet no quantitative metrics (error bars, system sizes, comparison to standard GR or other reweighting baselines, or convergence diagnostics) are supplied in the provided summary. This prevents verification that the variance reduction is achieved without introducing bias for the tested biases and observables.

minor comments (2)

[§2] Notation for the marginal weight function should be introduced with an explicit equation immediately after the definition of the classical Girsanov weight to avoid ambiguity when comparing the two.
[Figures] Figure captions for the reweighting results should include the exact observable being recovered (e.g., specific time-correlation function) and the simulation length in units of the unbiased timescale.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading and valuable suggestions. We address the major comments point by point below, indicating where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: [§3] §3 (Theoretical construction of MGR weights): the argument that marginalization over intermediate paths yields an unbiased estimator of the path probability ratio for kinetic observables (e.g., time-correlation functions or transition rates) is not fully secured. Classical GR supplies exact pathwise ratios; the manuscript must explicitly derive or verify that the expectation of the marginal weight recovers the correct marginal ratio while preserving time ordering, rather than assuming it follows automatically from the equilibrium case.

Authors: We agree that an explicit derivation is necessary to fully secure the unbiasedness claim for kinetic observables. In the revised manuscript, we will add a dedicated subsection in §3 that derives the marginal weight as an unbiased estimator of the path probability ratio. Specifically, we will show that by taking the expectation over intermediate paths, the marginal weight equals the Radon-Nikodym derivative between the marginal measures, while preserving the time-ordering of the dynamics. This will be verified analytically for time-correlation functions and transition rates, building directly on the classical Girsanov theorem without assuming equilibrium properties. revision: yes
Referee: [Experiments] Experiments section (results on molecular systems): the abstract and reported demonstrations assert accurate recovery of unbiased kinetic properties, yet no quantitative metrics (error bars, system sizes, comparison to standard GR or other reweighting baselines, or convergence diagnostics) are supplied in the provided summary. This prevents verification that the variance reduction is achieved without introducing bias for the tested biases and observables.

Authors: We note that the full manuscript does include some quantitative results on molecular systems, but we acknowledge that the presentation in the experiments section can be improved for clarity. In the revision, we will add explicit error bars from multiple independent runs, specify the system sizes used, include direct comparisons to standard GR where feasible (noting its variance issues for long times), and report convergence diagnostics such as weight histograms and effective sample sizes. This will provide stronger evidence that MGR achieves variance reduction without bias. revision: partial

Circularity Check

0 steps flagged

No significant circularity in the MGR derivation chain

full rationale

The paper introduces Marginal Girsanov Reweighting as an extension of classical Girsanov Reweighting by marginalizing over intermediate paths to stabilize weights for long-timescale dynamics. The abstract and described construction rely on the standard property that the expectation of the marginal Radon-Nikodym derivative recovers the correct path measure ratio, presented as a direct mathematical step rather than a fit or self-referential definition. No equations or claims reduce by construction to fitted parameters, renamed known results, or load-bearing self-citations; the central unbiasedness for kinetic observables follows from the marginalization identity under the stated path measures. Experiments on molecular systems under umbrella sampling and metadynamics supply independent validation, keeping the derivation self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Only the abstract is available, so the full ledger cannot be audited. The work likely rests on standard results from stochastic processes and introduces MGR as the primary new construct without visible free parameters or invented physical entities in the summary.

axioms (1)

standard math Girsanov theorem providing exact pathwise probability ratios under change of measure for diffusion processes
Classical GR is built on this theorem; MGR extends it by marginalization.

invented entities (1)

Marginal Girsanov Reweighting (MGR) no independent evidence
purpose: To produce stable weights for long-timescale reweighting by marginalizing intermediate paths
New method introduced to solve the variance issue described in the abstract.

pith-pipeline@v0.9.0 · 5649 in / 1335 out tokens · 41941 ms · 2026-05-18T12:02:09.459276+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

MGR defines the marginal weight as expectation of pathwise Girsanov reweighting (GR) factors... wkτ(xt,xt+kτ)=𝔼[wGRkτ∣Xt=xt,Xt+kτ=xt+kτ]
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

recovering unbiased kinetic and thermodynamic observables from the enhanced sampling simulations

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

13 extracted references · 13 canonical work pages · 1 internal anchor

[1]

Boltzmann priors for implicit transfer operators.arXiv preprint arXiv:2410.10605,

Juan Viguera Diez, Mathias Schreiner, Ola Engkvist, and Simon Olsson. Boltzmann priors for implicit transfer operators.arXiv preprint arXiv:2410.10605,

work page arXiv
[2]

Density estimation using Real NVP

URLhttps://openreview.net/forum?id=pRCOZllZdT. Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. Density estimation using real nvp.arXiv preprint arXiv:1605.08803,

work page internal anchor Pith review Pith/arXiv arXiv
[3]

doi: 10.1021/acs.jpclett.2c03327

ISSN 1948-7185. doi: 10.1021/acs.jpclett.2c03327. URL http://dx.doi.org/10.1021/acs.jpclett.2c03327. Christopher S Jones. Bayesian estimation of continuous-time finance models.manuscript University of Rochester,

work page doi:10.1021/acs.jpclett.2c03327 1948
[4]

Density ratio estimation with conditional probability paths.arXiv preprint arXiv:2502.02300,

Hanlin Yu, Arto Klami, Aapo Hyv¨arinen, Anna Korba, and Omar Chehab. Density ratio estimation with conditional probability paths.arXiv preprint arXiv:2502.02300,

work page arXiv
[5]

13 A VARIANCE OFGIRSANOVREWEIGHTING We analyze the variance behavior of Girsanov reweighting over a continuous time interval[t, t+τ]. According to Girsanov theory Girsanov (1960), the log-weight under the Girsanov transformation can be expressed as logw GR τ (xt,τ) = Z t+τ t u(x s, s)⊤ dWs − 1 2 Z t+τ t ∥u(x s, s)∥2 ds, whereu(x t, t) := f(x t,t)− ˜f(x t,...

work page 1960
[6]

and˜µ(xt,τ)(the perturbed process with drift term ˜f(·, t)). For any bounded measurable test functionO:R d ×R d →R, Eρτ(xt,xt+τ) [O(xt, xt+τ)] = Z O(xt, xt+τ)ρτ(xt, xt+τ)dxtdxt+τ = Z O(xt, xt+τ)δ(Xt −x t)δ(Xt+τ −x t+τ)dµ(xt,τ) = Z O(xt, xt+τ)δ(Xt −x t)δ(Xt+τ −x t+τ)dµ d˜µ(xt,τ)d˜µ(xt,τ) = Z O(xt, xt+τ)˜ρτ(xt, xt+τ)dµ d˜µ(xt,τ)d˜µ(xt,τ) :=E ˜ρτ(xt,xt+τ) [w...

work page 1963
[7]

amber14-all.xml

at300K. The system employed the Amber14 force field with OBC2 implicit water (“amber14-all.xml”, “implicit/obc2.xml”). Dynamics were propagated with an Underdamped Langevin integrator with time step2fs. The aggregated simulation time was 1µs. Coordinates and Girsanov reweighting (GR) factors were saved every 20 steps (40fs) using the Girsanov-enabled OPEN...

work page 2024
[8]

p(y|θ 0)p(θ)dydθ ≈ P θ θ w θ kτ (y)p(θ)P θ wθ kτ (y)p(θ) .(S.3) For the baseline particle marginal Metropolis–Hastings (PMMH) algorithm (Golightly & Wilkinson, 2008; Hoffman et al.,

work page 2008
[9]

We use the default setting in their original paper

and a variational inference (VI) approach (Ghosh et al., 2022). We use the default setting in their original paper. 20 E.2.1 GRAPHORNSTEIN-UHLENBECK We consider a two dimensional Graph Ornstein-Uhlenbeck (OU) process as introduced in Sec- tion 5.2.1. The Euler–Maruyama scheme with time step∆t= 10 −3 is used to generate a trajectory and the simulation is i...

work page 2022
[10]

Kernel moment matching, e.g

is a fundamental technique for comparing two distribu- tions. Kernel moment matching, e.g. KMM (Gretton et al., 2009), matches all the moments with reproducing kernels, which is effective and computationally efficient. Probabilistic classification recasts ratio estimation as posteriors from a binary classifier (Menon & Ong, 2016), showing pow- erful fitti...

work page 2009
[11]

Path-based methods (Choi et al., 2022; Yu et al.,

further performs classification in a learned latent space, mitigating issues caused by large distributional dis- crepancies. Path-based methods (Choi et al., 2022; Yu et al.,

work page 2022
[12]

By constructing consecutive path distributions, it alleviates the problems caused by poor overlap between two densities

connect the two distributions via a continuous probability path and estimate the density ratio by integrating a learned time score. By constructing consecutive path distributions, it alleviates the problems caused by poor overlap between two densities. However, unlike the standard setting with samples from both distributions, here we only have sam- ples f...

work page 2025
[13]

Featurized classifier (weighted BCE)We first map each paired sample to a latent representation zϕ = Φ(xt, xt+kτ ;ϕ), and then perform the classifier-based ratio estimation in this feature space. A joint training objective is adopted (Choi et al., 2021): Ljoint =αL BCE(θ, ϕ) + (1−α)L latent(ϕ), whereL BCE(θ, ϕ) =−E t [ct logh θ(zϕ) + log (1−h θ(zϕ))]is the...

work page 2021

[1] [1]

Boltzmann priors for implicit transfer operators.arXiv preprint arXiv:2410.10605,

Juan Viguera Diez, Mathias Schreiner, Ola Engkvist, and Simon Olsson. Boltzmann priors for implicit transfer operators.arXiv preprint arXiv:2410.10605,

work page arXiv

[2] [2]

Density estimation using Real NVP

URLhttps://openreview.net/forum?id=pRCOZllZdT. Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. Density estimation using real nvp.arXiv preprint arXiv:1605.08803,

work page internal anchor Pith review Pith/arXiv arXiv

[3] [3]

doi: 10.1021/acs.jpclett.2c03327

ISSN 1948-7185. doi: 10.1021/acs.jpclett.2c03327. URL http://dx.doi.org/10.1021/acs.jpclett.2c03327. Christopher S Jones. Bayesian estimation of continuous-time finance models.manuscript University of Rochester,

work page doi:10.1021/acs.jpclett.2c03327 1948

[4] [4]

Density ratio estimation with conditional probability paths.arXiv preprint arXiv:2502.02300,

Hanlin Yu, Arto Klami, Aapo Hyv¨arinen, Anna Korba, and Omar Chehab. Density ratio estimation with conditional probability paths.arXiv preprint arXiv:2502.02300,

work page arXiv

[5] [5]

13 A VARIANCE OFGIRSANOVREWEIGHTING We analyze the variance behavior of Girsanov reweighting over a continuous time interval[t, t+τ]. According to Girsanov theory Girsanov (1960), the log-weight under the Girsanov transformation can be expressed as logw GR τ (xt,τ) = Z t+τ t u(x s, s)⊤ dWs − 1 2 Z t+τ t ∥u(x s, s)∥2 ds, whereu(x t, t) := f(x t,t)− ˜f(x t,...

work page 1960

[6] [6]

and˜µ(xt,τ)(the perturbed process with drift term ˜f(·, t)). For any bounded measurable test functionO:R d ×R d →R, Eρτ(xt,xt+τ) [O(xt, xt+τ)] = Z O(xt, xt+τ)ρτ(xt, xt+τ)dxtdxt+τ = Z O(xt, xt+τ)δ(Xt −x t)δ(Xt+τ −x t+τ)dµ(xt,τ) = Z O(xt, xt+τ)δ(Xt −x t)δ(Xt+τ −x t+τ)dµ d˜µ(xt,τ)d˜µ(xt,τ) = Z O(xt, xt+τ)˜ρτ(xt, xt+τ)dµ d˜µ(xt,τ)d˜µ(xt,τ) :=E ˜ρτ(xt,xt+τ) [w...

work page 1963

[7] [7]

amber14-all.xml

at300K. The system employed the Amber14 force field with OBC2 implicit water (“amber14-all.xml”, “implicit/obc2.xml”). Dynamics were propagated with an Underdamped Langevin integrator with time step2fs. The aggregated simulation time was 1µs. Coordinates and Girsanov reweighting (GR) factors were saved every 20 steps (40fs) using the Girsanov-enabled OPEN...

work page 2024

[8] [8]

p(y|θ 0)p(θ)dydθ ≈ P θ θ w θ kτ (y)p(θ)P θ wθ kτ (y)p(θ) .(S.3) For the baseline particle marginal Metropolis–Hastings (PMMH) algorithm (Golightly & Wilkinson, 2008; Hoffman et al.,

work page 2008

[9] [9]

We use the default setting in their original paper

and a variational inference (VI) approach (Ghosh et al., 2022). We use the default setting in their original paper. 20 E.2.1 GRAPHORNSTEIN-UHLENBECK We consider a two dimensional Graph Ornstein-Uhlenbeck (OU) process as introduced in Sec- tion 5.2.1. The Euler–Maruyama scheme with time step∆t= 10 −3 is used to generate a trajectory and the simulation is i...

work page 2022

[10] [10]

Kernel moment matching, e.g

is a fundamental technique for comparing two distribu- tions. Kernel moment matching, e.g. KMM (Gretton et al., 2009), matches all the moments with reproducing kernels, which is effective and computationally efficient. Probabilistic classification recasts ratio estimation as posteriors from a binary classifier (Menon & Ong, 2016), showing pow- erful fitti...

work page 2009

[11] [11]

Path-based methods (Choi et al., 2022; Yu et al.,

further performs classification in a learned latent space, mitigating issues caused by large distributional dis- crepancies. Path-based methods (Choi et al., 2022; Yu et al.,

work page 2022

[12] [12]

By constructing consecutive path distributions, it alleviates the problems caused by poor overlap between two densities

connect the two distributions via a continuous probability path and estimate the density ratio by integrating a learned time score. By constructing consecutive path distributions, it alleviates the problems caused by poor overlap between two densities. However, unlike the standard setting with samples from both distributions, here we only have sam- ples f...

work page 2025

[13] [13]

Featurized classifier (weighted BCE)We first map each paired sample to a latent representation zϕ = Φ(xt, xt+kτ ;ϕ), and then perform the classifier-based ratio estimation in this feature space. A joint training objective is adopted (Choi et al., 2021): Ljoint =αL BCE(θ, ϕ) + (1−α)L latent(ϕ), whereL BCE(θ, ϕ) =−E t [ct logh θ(zϕ) + log (1−h θ(zϕ))]is the...

work page 2021