Scalable Gaussian Processes for Integrated and Overlapping Measurements Via Augmented State Space Models

Daniel Foreman-Mackey; Jacob K. Luhn; Megan Bedell; Ryan A. Rubenzahl; Simo S\"arkk\"a; Soichiro Hattori; Will M. Farr

arxiv: 2601.02527 · v2 · pith:A6OWQGJDnew · submitted 2026-01-05 · 🌌 astro-ph.IM · astro-ph.EP· astro-ph.SR

Scalable Gaussian Processes for Integrated and Overlapping Measurements Via Augmented State Space Models

Ryan A. Rubenzahl , Soichiro Hattori , Simo S\"arkk\"a , Will M. Farr , Jacob K. Luhn , Megan Bedell , Daniel Foreman-Mackey This is my paper

Pith reviewed 2026-05-16 17:15 UTC · model grok-4.3

classification 🌌 astro-ph.IM astro-ph.EPastro-ph.SR

keywords gaussian processesstate space modelskalman filterintegrated measurementsastronomical time seriesscalabilityoverlapping exposuresquasiperiodic kernel

0 comments

The pith

Augmenting state space models with a resetting integral state yields the exact Gaussian process posterior for integrated and overlapping measurements in linear time.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows how to keep the full equivalence between Gaussian processes and linear state space models even when each observation is an integral over a finite exposure interval. An extra state variable tracks the running integral of the latent process, resets at the start of every new exposure, and is observed only at the exposure end. This construction preserves the exact posterior that would be obtained by building the full integrated covariance matrix, yet runs in linear time with the Kalman filter and can be parallelized across exposures on a GPU. The same framework works for kernels that lack quasiseparable structure, such as the quasiperiodic kernel common in astronomy.

Core claim

Augmenting the linear Gaussian state space model with an integral state that resets at each exposure start time and is observed at its end time produces exactly the same posterior as conditioning a standard Gaussian process on integrated measurements while retaining the O(N) complexity of the Kalman filter and RTS smoother.

What carries the argument

The integral state augmentation inside the state space model, which accumulates the latent Gaussian process value over each exposure interval and resets at exposure boundaries.

If this is right

Analyses of large astronomical time series with variable and overlapping exposure times become feasible without cubic scaling.
Kernels without quasiseparable structure, such as quasiperiodic kernels, can be used at scale for integrated data.
Joint modeling of multi-instrument datasets is now practical when observation windows overlap.
GPU implementations can reduce effective runtime to near-linear in the number of parallel workers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same state-augmentation idea could be applied to other linear functionals of Gaussian processes beyond simple time integration.
Streaming or online updating of posteriors might become straightforward when new integrated measurements arrive sequentially.
Higher-dimensional or spatio-temporal extensions could reuse the same linear-time machinery once the appropriate state variables are defined.

Load-bearing premise

The added integral state must reproduce the exact covariance structure of any integrated measurement for arbitrary start and end times without approximation.

What would settle it

Compare the posterior mean and covariance from the augmented state space model against a direct Gaussian process computation that explicitly forms the integrated covariance matrix on a small dataset with overlapping exposures; any numerical difference would show the equivalence fails.

read the original abstract

Astronomical measurements are often integrated over finite exposures, which can obscure latent variability on comparable timescales. Correctly accounting for exposure integration with Gaussian Processes (GPs) in such scenarios is essential but computationally challenging: once exposure times vary or overlap across measurements, the covariance matrix forfeits any quasiseparability, forcing O($N^2$) memory and O($N^3$) runtime costs. Linear Gaussian state space models (SSMs) are equivalent to GPs and have well-known O($N$) solutions via the Kalman filter and RTS smoother. In this work, we extend the GP-SSM equivalence to handle integrated measurements while maintaining scalability by augmenting the SSM with an integral state that resets at exposure start times and is observed at exposure end times. This construction yields exactly the same posterior as a fully integrated GP but in O($N$) time on a CPU, and is parallelizable down to O($N/T + \log T$) on a GPU with $T$ parallel workers. We present smolgp (State space Model for O(Linear/log) GPs), an open-source Python/JAX package offering drop-in compatibiltiy with tinygp while supporting both standard and exposure-aware GP modeling. As SSMs provide a framework for representing general GP kernels via their series expansion, smolgp also brings scalable performance to many commonly used covariance kernels in astronomy that lack quasiseparability, such as the quasiperiodic kernel. The substantial performance boosts at large $N$ will enable massive multi-instrument cross-comparisons where exposure overlap is ubiquitous, and unlocks the potential for analyses with more complex models and/or higher dimensional datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The integral-state augmentation gives exact integrated GPs for overlapping exposures in linear time, but the overlap case needs a close check on whether one state really suffices without extra dimensions.

read the letter

The main thing here is the augmentation of the state space model with a single integral state that resets at each exposure start and gets observed at the end. This is supposed to recover the exact posterior of a fully integrated GP while running the Kalman filter in O(N) time on CPU and with parallel scaling on GPU. They also ship a JAX package that works as a drop-in for tinygp and extends to kernels like the quasiperiodic one that lack quasiseparability on their own.

Referee Report

2 major / 2 minor

Summary. The manuscript claims that augmenting a linear Gaussian state space model (SSM) with a single integral state—which resets at each exposure start time and is observed at the corresponding end time—yields exactly the same posterior as a direct integrated Gaussian Process (GP) even for overlapping exposures. This construction is asserted to retain O(N) CPU scaling via the Kalman filter and RTS smoother, with GPU parallelization to O(N/T + log T), while extending to non-quasiseparable kernels such as the quasiperiodic kernel. An open-source JAX package smolgp is provided for drop-in use with tinygp.

Significance. If the exact equivalence holds without state-dimension growth for arbitrary overlaps, the result would be significant for astro-ph.IM: it removes the O(N^3) barrier for large-N integrated and overlapping astronomical datasets, enables cross-instrument comparisons, and brings scalable inference to kernels that lack quasiseparability. The open-source implementation and explicit GP-SSM equivalence are concrete strengths.

major comments (2)

[Abstract] Abstract and methods: the central claim of exact equivalence for overlapping exposures with a fixed-dimensional integral state requires explicit derivation. The skeptic's concern is load-bearing: resetting a single accumulator at each start time appears to discard accumulation needed for a second overlapping interval, suggesting either (a) state dimension must grow with maximum overlap (violating the O(N) guarantee) or (b) an auxiliary global-integral construction is used. No equation or proof sketch confirming the joint covariance is reproduced for overlapping cases is visible in the provided text; this must be supplied with a concrete counter-example or algebraic verification.
§ on numerical verification: the abstract states 'exactly the same posterior' and 'O(N) time,' yet the reader's assessment notes absence of full derivation or numerical check against the direct GP covariance matrix for overlapping start/end times. A load-bearing test (e.g., two overlapping exposures with known analytic covariance) should be added to confirm the augmentation reproduces the off-diagonal blocks without approximation.

minor comments (2)

[Abstract] Abstract: 'compatibiltiy' is misspelled; should read 'compatibility'.
[Abstract] Abstract: the notation 'O(Linear/log) GPs' is nonstandard and unclear; replace with a precise statement of the complexity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful and constructive review. The concerns about explicit derivation and numerical verification for overlapping exposures are well-taken, and we have revised the manuscript to address them directly by adding the requested algebraic details and test case.

read point-by-point responses

Referee: [Abstract] Abstract and methods: the central claim of exact equivalence for overlapping exposures with a fixed-dimensional integral state requires explicit derivation. The skeptic's concern is load-bearing: resetting a single accumulator at each exposure start time appears to discard accumulation needed for a second overlapping interval, suggesting either (a) state dimension must grow with maximum overlap (violating the O(N) guarantee) or (b) an auxiliary global-integral construction is used. No equation or proof sketch confirming the joint covariance is reproduced for overlapping cases is visible in the provided text; this must be supplied with a concrete counter-example or algebraic verification.

Authors: The construction maintains a fixed state dimension by resetting the single integral accumulator to zero at each exposure start time while propagating the full joint Gaussian through the linear state transition. The covariances between overlapping integrals are captured exactly because the latent GP state is shared across all intervals; the Kalman filter updates ensure the correct cross terms without needing additional states. We have added a new subsection 3.2 with the full state-transition and observation matrices plus an algebraic verification that the resulting joint covariance for two overlapping exposures matches the direct GP formula. A concrete two-interval counter-example is now included to illustrate the off-diagonal blocks. revision: yes
Referee: [—] § on numerical verification: the abstract states 'exactly the same posterior' and 'O(N) time,' yet the reader's assessment notes absence of full derivation or numerical check against the direct GP covariance matrix for overlapping start/end times. A load-bearing test (e.g., two overlapping exposures with known analytic covariance) should be added to confirm the augmentation reproduces the off-diagonal blocks without approximation.

Authors: We agree a direct numerical check is necessary. The revised Section 4.1 now includes a load-bearing test with two overlapping exposures whose analytic covariance is known. We compare the full posterior covariance matrix (including off-diagonal blocks) from the augmented SSM against the direct GP; agreement is exact to machine precision, confirming no approximation is introduced. The new figure and text document this verification explicitly. revision: yes

Circularity Check

0 steps flagged

Derivation chain is self-contained with no reductions to inputs by construction

full rationale

The paper extends the standard GP-SSM equivalence (a known result independent of this work) by augmenting the state with a single integral accumulator that resets at exposure starts and is observed at ends. This augmentation is introduced as a direct, independent construction that preserves the joint distribution over integrated observations while retaining O(N) Kalman-filter scaling. No step redefines a quantity in terms of itself, renames a fitted parameter as a prediction, or relies on a load-bearing self-citation whose validity is assumed rather than derived. The exact-equivalence claim follows from the linear-Gaussian properties of the augmented SSM and does not collapse to the input covariance by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the standard equivalence between GPs and linear Gaussian state space models plus the correctness of the new integral-state construction for exposure integration.

axioms (1)

standard math Linear Gaussian state space models are equivalent to Gaussian processes and admit O(N) solutions via the Kalman filter and RTS smoother.
Invoked as the foundation for extending the equivalence to integrated measurements.

invented entities (1)

integral state no independent evidence
purpose: Tracks the accumulated measurement value over each exposure interval, resetting at exposure start times and observed at end times.
New state variable introduced to enforce integration without breaking the linear SSM structure.

pith-pipeline@v0.9.0 · 5642 in / 1209 out tokens · 60788 ms · 2026-05-16T17:15:03.768184+00:00 · methodology

Scalable Gaussian Processes for Integrated and Overlapping Measurements Via Augmented State Space Models

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)