pith. sign in

arxiv: 2510.24539 · v2 · pith:5IXS25ACnew · submitted 2025-10-28 · 📊 stat.ME

Unbiased likelihood estimation of the Langevin diffusion for animal movement modelling

Pith reviewed 2026-05-21 20:43 UTC · model grok-4.3

classification 📊 stat.ME
keywords animal movementLangevin diffusionimportance samplingBrownian bridgetelemetryhabitat selectioncontinuous time modelsunbiased estimation
0
0 comments X

The pith

A Brownian bridge importance sampler produces unbiased estimates for parameters in the Langevin diffusion model of animal movement.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method to estimate parameters in continuous-time movement models that handle irregular sampling times without introducing bias. Standard approaches to fitting Langevin diffusions to telemetry data suffer increasing bias as gaps between observations grow larger. The new approach uses importance sampling with Brownian bridges to approximate the likelihood more accurately. Simulations across different scenarios demonstrate that bias is effectively eliminated. When applied to real tracking data from sea lions, the model yields habitat preference estimates that differ from those in earlier work, potentially altering interpretations of ecological patterns.

Core claim

We propose an importance sampling scheme that employs Brownian bridges to approximate the likelihood function of the Langevin diffusion. This approximation becomes accurate as the number of sampled bridges increases, leading to unbiased maximum likelihood estimates for the model parameters. Through simulation studies, we verify that this method recovers the true parameters without bias for various sampling regimes and parameter values. In contrast to previous methods, our estimates maintain accuracy even for data with long intervals between observations. Applying the model to Steller sea lion tracking data shows convergence to habitat coefficients that are significantly different from those,

What carries the argument

Brownian bridge importance sampler that constructs proposal paths between observed locations to approximate the transition density of the Langevin diffusion

Load-bearing premise

That the importance sampler using Brownian bridges will converge to the exact likelihood of the Langevin diffusion when enough bridge samples are drawn.

What would settle it

Running the estimation on simulated data with known true parameters and observing whether the estimates approach the truth as the number of bridge samples goes to several thousand.

Figures

Figures reproduced from arXiv: 2510.24539 by Martin E. Pettersen, Robert B. O'Hara, Ron R. Togunov, S. Knutsen Furset.

Figure 1
Figure 1. Figure 1: Illustration of the proposed BBIS for one step. Panel a) shows tracks [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Two examples of simulated tracks from (1), superimposed on the (log) RSF. n = 2000 measurements are displayed with a step size ∆t = 0.5 between measurements. Covariates were generated using Perlin noise. Blue corresponds to large values of the (log) RSF, yellow corresponds to smaller values. 4.1 Simulation study I In the first numerical experiment we simulated 100 tracks with step-size 0.01 and thinned the… view at source ↗
Figure 3
Figure 3. Figure 3: Estimates of β⃗ and γ using ∆t as distance between observations and 5000 observations. The red dotted line shows the true value of the parameters We observe in [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Estimates of β⃗ and γ using ∆t as distance between observations and 500 as time span of tracks. The red dotted line shows the true value of the parameters From [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Estimates of β⃗ and γ using N bridge nodes, M = 50 bridges, ∆t = 1 and 5000 observations. The red dotted line shows the true value of the param￾eters [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Estimates of β⃗ and γ using N = 50 bridge nodes, M bridges, ∆t = 1 and 5000 observations. The red dotted line shows the true value of the param￾eters In [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
read the original abstract

An ongoing challenge in animal ecology is developing movement models that account for the autocorrelation, and often temporal irregularity, in telemetry data. Continuous-time Langevin diffusion models have been proposed to model temporally autocorrelated and irregularly sampled data. However, current estimation techniques obtain increasingly biased parameter estimates as the time between observations increases. In this paper, we propose using Brownian bridges in an importance sampling scheme to improve the likelihood approximation of the Langevin diffusion model. In a series of simulation studies, we showed that our approach effectively removed the bias under various scenarios. We found that the precision of the estimated habitat coefficients increased for data spanning a longer duration at a lower frequency than for shorter, more frequently sampled tracks. This suggests that the model may be well suited for modelling tracking data sampled at a coarser resolution, as is common in datasets collected with older generations of animal tags. We illustrated the application of our model using tracking data from Steller sea lions, \textit{Eumetopias jubatus}. We found that the coefficient estimates converged to values significantly different than those estimated in previous studies, suggesting that bias in conventional estimation methods may meaningfully affect ecological conclusions about habitat preference. Together, these improvements broaden the applicability of Langevin diffusion models, thereby improving ecological insight into habitat selection.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes using Brownian bridges within an importance sampling scheme to approximate the intractable likelihood of the continuous-time Langevin diffusion for modeling autocorrelated and irregularly sampled animal telemetry data. Simulations are used to show that the approach removes bias in parameter estimates under various scenarios, with improved precision for longer-duration, lower-frequency tracks; the method is then applied to Steller sea lion tracking data, yielding habitat coefficient estimates that differ from those in prior studies.

Significance. If the importance sampler is shown to deliver unbiased (or at least consistent) estimates, the work would meaningfully extend the practical range of Langevin diffusion models to the coarser sampling frequencies typical of older telemetry tags, thereby strengthening ecological inferences about habitat selection.

major comments (2)
  1. [§3] §3 (importance sampling construction): the claim that the Brownian-bridge importance sampler converges to the true transition density of the Langevin SDE as the number of bridges increases is central to the 'unbiased' title and abstract, yet no analytic derivation, bias/variance analysis, or consistency proof under the model SDE is supplied; validation rests only on indirect simulation recovery.
  2. [§4] §4 (simulation studies): the reported parameter recovery without bias is load-bearing for the central claim, but the text does not specify the number of Monte Carlo replicates, the precise data-generating process (including observation-time irregularity), or any diagnostics for importance-weight degeneracy or effective sample size; without these, it is unclear whether the method remains reliable outside the simulated regimes.
minor comments (2)
  1. [Abstract] The abstract states that 'precision of the estimated habitat coefficients increased' for longer, lower-frequency tracks, but does not report the corresponding standard errors or confidence intervals that would allow readers to judge the magnitude of the improvement.
  2. [§3] Notation for the importance weights and the number of bridge samples is introduced without a clear summary table or algorithmic pseudocode, which would aid reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their detailed and constructive comments, which have helped us improve the clarity and rigor of the manuscript. We address each major comment point by point below.

read point-by-point responses
  1. Referee: §3 (importance sampling construction): the claim that the Brownian-bridge importance sampler converges to the true transition density of the Langevin SDE as the number of bridges increases is central to the 'unbiased' title and abstract, yet no analytic derivation, bias/variance analysis, or consistency proof under the model SDE is supplied; validation rests only on indirect simulation recovery.

    Authors: We appreciate the referee drawing attention to the theoretical basis of the convergence claim. The construction relies on the well-known property that a sequence of Brownian bridges with an increasing number of intermediate points converges in distribution to the true diffusion path under the Langevin SDE, allowing the importance-weighted average to recover the exact transition density in the limit. While the original manuscript emphasized simulation validation, we acknowledge that an explicit consistency argument was only sketched heuristically. In the revised manuscript we have expanded Section 3 with a short derivation outline based on the Girsanov change of measure and pathwise approximation results for diffusions, together with a brief bias analysis showing that the Monte Carlo error vanishes as the number of bridges tends to infinity. A full formal proof of consistency for this specific sampler is not included, as it would constitute a separate theoretical contribution beyond the applied focus of the paper. revision: partial

  2. Referee: §4 (simulation studies): the reported parameter recovery without bias is load-bearing for the central claim, but the text does not specify the number of Monte Carlo replicates, the precise data-generating process (including observation-time irregularity), or any diagnostics for importance-weight degeneracy or effective sample size; without these, it is unclear whether the method remains reliable outside the simulated regimes.

    Authors: We agree that these implementation details are essential for reproducibility and for evaluating the practical reliability of the sampler. In the revised manuscript we have added the exact number of Monte Carlo replicates (1,000 per scenario), a precise description of the data-generating process (including how irregular observation times were drawn from a uniform distribution over the study interval), and explicit diagnostics for effective sample size and weight degeneracy (reported in the main text and Supplementary Material). These additions confirm that degeneracy remained negligible across the simulated regimes and that the unbiased recovery holds under the tested conditions of autocorrelation and sampling irregularity. revision: yes

Circularity Check

0 steps flagged

No circularity detected in the Brownian bridge importance sampler derivation

full rationale

The paper constructs a new importance sampling estimator for the intractable transition density of the Langevin diffusion using Brownian bridges. This construction is defined independently of the target parameters and does not reduce to any fitted quantity or self-referential definition. The claim of unbiasedness follows from the standard limiting behavior of importance sampling as the number of bridge samples tends to infinity, with empirical validation supplied by simulation recovery rather than by renaming or tautological re-expression of inputs. No load-bearing self-citations, uniqueness theorems imported from the authors' prior work, or ansatz smuggling appear in the derivation chain; the method remains self-contained against external benchmarks of importance sampling theory.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The method implicitly relies on standard properties of Brownian motion and importance sampling whose validity is taken from prior literature.

pith-pipeline@v0.9.0 · 5769 in / 1116 out tokens · 28746 ms · 2026-05-21T20:43:55.000974+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

27 extracted references · 27 canonical work pages

  1. [1]

    Comparative in- terpretation of count, presence–absence and point methods for species dis- tribution models

    Geert Aarts, John Fieberg, and Jason Matthiopoulos. “Comparative in- terpretation of count, presence–absence and point methods for species dis- tribution models”. en.Methods in Ecology and Evolution 3.1 (Feb. 2012), pp. 177–187.doi:10.1111/j.2041-210X.2011.00141.x

  2. [2]

    Closed-form likelihood expansions for multivariate diffusions

    Yacine Aït-Sahalia. “Closed-form likelihood expansions for multivariate diffusions”.The Annals of Statistics 36.2 (Apr. 2008).doi:10 . 1214 / 009053607000000622

  3. [3]

    Maximum Likelihood Estimation of Discretely Sam- pled Diffusions: A Closed-form Approximation Approach

    Yacine Aït-Sahalia. “Maximum Likelihood Estimation of Discretely Sam- pled Diffusions: A Closed-form Approximation Approach”.Econometrica 70.1 (Jan. 2002), pp. 223–262.doi:10.1111/1468-0262.00274

  4. [4]

    Spatial prediction of species distribution: an interface between ecological theory and statistical modelling

    M.P Austin. “Spatial prediction of species distribution: an interface between ecological theory and statistical modelling”. en. Ecological Modelling 157.2-3 (Nov. 2002), pp. 101–118.doi: 10.1016/S0304-3800(02)00205-3

  5. [5]

    Integrated step selection analysis: bridging the gap between resource selection and animal movement

    Tal Avgar et al. “Integrated step selection analysis: bridging the gap between resource selection and animal movement”. en. Methods in Ecology and Evolution 7.5 (May 2016). Ed. by Luca Börger, pp. 619–630.doi:10.1111/2041-210X.12528

  6. [6]

    Estimating Functions for Discretely Sampled Diffusion-Type Models

    Bo Martin Bibby, Martin Jacobsen, and Michael Sørensen. “Estimating Functions for Discretely Sampled Diffusion-Type Models”. Handbook of Financial Econometrics: Tools and Techniques. Elsevier, 2010, pp. 203–268.doi:10.1016/b978-0-444-50897-3.50007-9

  7. [7]

    Relating populations to habi- tats using resource selection functions

    Mark S. Boyce and Lyman L. McDonald. “Relating populations to habi- tats using resource selection functions”. Trends in ecology & evolution 14.7 (1999). Publisher: Elsevier, pp. 268–272

  8. [8]

    Numerical Techniques for Maximum Likelihood Estimation of Continuous-Time Diffusion Processes

    Garland B Durham and A. Ronald Gallant. “Numerical Techniques for Maximum Likelihood Estimation of Continuous-Time Diffusion Processes”. Journal of Business and Economic Statistics 20.3 (July 2002), pp. 297–338.doi:10.1198/073500102288618397

  9. [9]

    Rayleigh step-selection functions and connections to continuous-time mechanistic movement models

    Joseph M. Eisaguirre, Perry J. Williams, and Mevin B. Hooten. “Rayleigh step-selection functions and connections to continuous-time mechanistic movement models”.Movement Ecology 12.1 (Feb. 2024), p. 14.doi:10. 1186/s40462-023-00442-w

  10. [10]

    A ‘How to’ guide for interpreting parameters in habitat-selection analyses

    John Fieberg et al. “A ‘How to’ guide for interpreting parameters in habitat-selection analyses”. en. Journal of Animal Ecology 90.5 (May 2021), pp. 1027–1043.doi:10.1111/1365-2656.13441

  11. [11]

    Finite-sample equivalence in statis- tical models for presence-only data

    William Fithian and Trevor Hastie. “Finite-sample equivalence in statis- tical models for presence-only data”.The annals of applied statistics 7.4 (2012), p. 1917. 14

  12. [12]

    An Introduction to Statistical Models Used to Characterize Species-Habitat Associations with Animal Movement Data

    Katie R. N. Florko et al. “An Introduction to Statistical Models Used to Characterize Species-Habitat Associations with Animal Movement Data”. Movement Ecology 13.1 (Apr. 17, 2025), p. 27.doi:10.1186/s40462- 025-00549-2

  13. [13]

    Accounting for animal movement in estimation of resource selection functions: sampling and data analysis

    James D. Forester, Hae Kyung Im, and Paul J. Rathouz. “Accounting for animal movement in estimation of resource selection functions: sampling and data analysis”. en.Ecology 90.12 (Dec. 2009), pp. 3554–3565.doi: 10.1890/08-0874.1

  14. [14]

    Multilevel Monte Carlo Methods

    Stefan Heinrich. “Multilevel Monte Carlo Methods”. Large-Scale Scientific Computing. Springer Berlin Heidelberg, 2001, pp. 58–67.doi:10.1007/3-540-45346-6_5

  15. [15]

    Methods for im- plementing integrated step-selection functions with incomplete data

    David D. Hofmann, Gabriele Cozzi, and John Fieberg. “Methods for im- plementing integrated step-selection functions with incomplete data”. en. Movement Ecology 12.1 (May 2024), p. 37.doi:10.1186/s40462-024- 00476-8

  16. [17]

    Continuous-time Correlated Random Walk Model for Animal Telemetry Data

    Devin S. Johnson et al. “Continuous-time Correlated Random Walk Model for Animal Telemetry Data”.Ecology 89.5 (May 2008), pp. 1208–1215. doi:10.1890/07-1032.1

  17. [18]

    Movement responses to environment: fast inference of variation among southern elephant seals with a mixed effects model

    I. D. Jonsen et al. “Movement responses to environment: fast inference of variation among southern elephant seals with a mixed effects model”. en. Ecology 100.1 (2019), e02566.doi:10.1002/ecy.2566

  18. [19]

    Weighted distributions and es- timation of resource selection probability functions

    Subhash R. Lele and Jonah L. Keim. “Weighted distributions and es- timation of resource selection probability functions”. en.Ecology 87.12 (Dec. 2006), pp. 3021–3028.doi:10.1890/0012- 9658(2006)87[3021: WDAEOR]2.0.CO;2

  19. [20]

    A regularized bridge sampler for sparsely sampled dif- fusions

    Erik Lindström. “A regularized bridge sampler for sparsely sampled dif- fusions”.Statistics and Computing 22.2 (May 2011), pp. 615–623.doi: 10.1007/s11222-011-9255-y

  20. [21]

    Bryan F. J. Manly, Lyman L. McDonald, and Dana L. Thomas. Resource Selection by Animals. Springer Netherlands, 1993.doi: 10.1007/978-94-011-1558-2

  21. [22]

    The Langevin diffusion as a continuous- time model of animal movement and habitat selection

    Théo Michelot et al. “The Langevin diffusion as a continuous- time model of animal movement and habitat selection”. Methods in Ecology and Evolution 10.11 (Aug. 2019). Ed. by Torbjørn Ergon, pp. 1894–1907.doi:10.1111/2041-210x.13275

  22. [23]

    Analysis of Resource Selection Using Utiliza- tion Distributions

    Joshua J. Millspaugh et al. “Analysis of Resource Selection Using Utiliza- tion Distributions”. en.The Journal of Wildlife Management 70.2 (2006), pp. 384–395.doi:10.2193/0022-541X(2006)70[384:AORSUU]2.0.CO;2. 15

  23. [24]

    A spatially explicit habitat selection model in- corporating home range behavior

    Jonathan R. Rhodes et al. “A spatially explicit habitat selection model in- corporating home range behavior”. en.Ecology 86.5 (May 2005), pp. 1199– 1205.doi:10.1890/04-0912

  24. [25]

    Exponential Convergence of Langevin Distributions and Their Discrete Approximations

    Gareth O. Roberts and Richard L. Tweedie. “Exponential Convergence of Langevin Distributions and Their Discrete Approximations”.Bernoulli 2.4 (Dec. 1996), p. 341.doi:10.2307/3318418

  25. [26]

    Estimating utilization distributions from fitted step-selection functions

    Johannes Signer, John Fieberg, and Tal Avgar. “Estimating utilization distributions from fitted step-selection functions”. en.Ecosphere 8.4 (Apr. 2017), e01771.doi:10.1002/ecs2.1771

  26. [27]

    Improved bridge constructs for stochastic dif- ferential equations

    Gavin A. Whitaker et al. “Improved bridge constructs for stochastic dif- ferential equations”.Statistics and Computing 27.4 (May 2016), pp. 885– 900.doi:10.1007/s11222-016-9660-3

  27. [28]

    Kernel Methods for Estimating the Utilization Distribu- tion in Home-Range Studies

    B. J. Worton. “Kernel Methods for Estimating the Utilization Distribu- tion in Home-Range Studies”.Ecology 70.1 (1989). Publisher: Ecological Society of America, pp. 164–168.doi:10.2307/1938423. 16