Conformal Prediction with Time-Series Data via Sequential Conformalized Density Regions

K.S. Chan; M. Sampson

arxiv: 2604.07325 · v1 · submitted 2026-04-08 · 📊 stat.ME · math.ST· stat.ML· stat.TH

Conformal Prediction with Time-Series Data via Sequential Conformalized Density Regions

M. Sampson , K.S. Chan This is my paper

Pith reviewed 2026-05-10 17:53 UTC · model grok-4.3

classification 📊 stat.ME math.STstat.MLstat.TH

keywords conformal predictiontime series analysisconditional coverageprediction intervalsdensity estimationquantile random forestasymptotic theorybifurcation

0 comments

The pith

The SCDR method achieves asymptotic conditional coverage for time-series predictions by adjusting density regions with a quantile random forest step.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Sequential Conformalized Density Regions (SCDR) as a conformal prediction technique tailored for time-series data. It begins with estimated conditional highest-density regions and applies a quantile random forest adjustment to ensure coverage while respecting the sequential dependence. The method is shown to be doubly robust, delivering the desired coverage if the density model is right or if the conformal scores obey a correctly ordered nonlinear autoregressive process. Readers should care because reliable prediction sets for dependent data enable better uncertainty quantification in forecasting applications, and the method can output disconnected sets when the underlying process has multiple modes.

Core claim

We propose Sequential Conformalized Density Regions (SCDR) which forms initial predictive regions from estimated conditional highest density regions and then uses quantile random forest conformal adjustment to provide guaranteed asymptotic conditional coverage for time-series data. The method is doubly robust: it attains the coverage rate asymptotically if the predictive density is correctly specified or if the scores follow a nonlinear autoregressive model of the correct order. It can produce both connected intervals and disconnected sets that signal bifurcations.

What carries the argument

The quantile random forest conformal adjustment applied sequentially to highest-density regions, which adapts to non-exchangeability in the time series.

If this is right

The method attains guaranteed asymptotic conditional coverage under regularity conditions.
It produces smaller and more informative prediction sets than existing approaches in simulations.
It can form disconnected prediction sets to represent bifurcations in the data.
The double robustness allows validity even under partial model misspecification.
Empirical performance improves on real datasets like geyser eruptions and electricity usage.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The double robustness property suggests the method could be robust in many practical settings where one component is easier to model correctly than the other.
Detecting disconnected sets might serve as an early indicator of regime shifts or bifurcations in the underlying process.
Extending the sequential adjustment to other forms of dependence, such as spatial or network data, appears feasible given the adaptation to non-exchangeability.
If the autoregressive model for scores is used, it opens a path to integrate time-series modeling directly into conformal calibration.

Load-bearing premise

The data satisfy the regularity conditions needed for the asymptotic result, and either the conditional density estimator is consistent or the score process follows the assumed nonlinear autoregressive model.

What would settle it

A simulation or real dataset where the coverage rate fails to approach the target level as the sample size increases, even when one of the two robustness conditions holds.

Figures

Figures reproduced from arXiv: 2604.07325 by K.S. Chan, M. Sampson.

**Figure 1.** Figure 1: ACF Plots for the 2,000 Observed Data Points, Bootstrap Approach Simulation. [PITH_FULL_IMAGE:figures/full_fig_p069_1.png] view at source ↗

**Figure 2.** Figure 2: Partial ACF Plots for the 2,000 Observed Data Points, Bootstrap Approach [PITH_FULL_IMAGE:figures/full_fig_p070_2.png] view at source ↗

**Figure 3.** Figure 3: ACF Plots for the 500 Observed Data Points, Leave One Out Approach Simula [PITH_FULL_IMAGE:figures/full_fig_p071_3.png] view at source ↗

**Figure 4.** Figure 4: Partial ACF Plots for the 500 Observed Data Points, Leave One Out Approach [PITH_FULL_IMAGE:figures/full_fig_p071_4.png] view at source ↗

**Figure 5.** Figure 5: ACF Plots for the AR(2) Simulation. From top left to right: SPCI, PID, BCI, [PITH_FULL_IMAGE:figures/full_fig_p072_5.png] view at source ↗

**Figure 6.** Figure 6: Partial ACF Plots for the AR(2) Simulation. From top left to right: SPCI, PID, [PITH_FULL_IMAGE:figures/full_fig_p072_6.png] view at source ↗

**Figure 7.** Figure 7: ACF Plots for the Old Faithful Eruption Scores. From top left to right: SPCI, [PITH_FULL_IMAGE:figures/full_fig_p073_7.png] view at source ↗

**Figure 8.** Figure 8: PACF Plots for the Old Faithful Eruption Scores. From top left to right: SPCI, [PITH_FULL_IMAGE:figures/full_fig_p073_8.png] view at source ↗

**Figure 9.** Figure 9: ACF Plots for the Electricity Scores. From top left to right: SPCI, PID, BCI, [PITH_FULL_IMAGE:figures/full_fig_p074_9.png] view at source ↗

**Figure 10.** Figure 10: PACF Plots for the Electricity Scores. From top left to right: SPCI, PID, BCIm [PITH_FULL_IMAGE:figures/full_fig_p074_10.png] view at source ↗

read the original abstract

We propose a new conformal prediction method for time-series data with a guaranteed asymptotic conditional coverage rate, Sequential Conformalized Density Regions (SCDR), which is flexible enough to produce both prediction intervals and disconnected prediction sets, signifying the emergence of bifurcations. Our approach uses existing estimated conditional highest density predictive regions to form initial predictive regions. We then use a quantile random forest conformal adjustment to provide guaranteed coverage while adaptively changing to take the non-exchangeable nature of time-series data into account. We show that the proposed method achieves the guaranteed coverage rate asymptotically under certain regularity conditions. In particular, the method is doubly robust -- it works if the predictive density model is correctly specified and/or if the scores follow a nonlinear autoregressive model with the correct order specified. Simulations reveal that the proposed method outperforms existing methods in terms of empirical coverage rates and set sizes. We illustrate the method using two real datasets, the Old Faithful geyser dataset and the Australian electricity usage dataset. Prediction sets formed using SCDR for the geyser eruption durations include both single intervals and unions of two intervals, whereas existing methods produce wider, less informative, single-interval prediction sets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SCDR gives a flexible way to get asymptotic conditional coverage for time series by starting from density regions and adjusting with a quantile forest, but the double-robustness claim rests on either a correct density or an exact NAR order for the scores.

read the letter

The paper introduces Sequential Conformalized Density Regions, which takes an initial highest-density region from a fitted conditional density and then uses a quantile random forest to adjust the conformity scores for dependence. This produces asymptotic conditional coverage and can output disconnected sets when the data shows bifurcations, as in the Old Faithful example where it splits into two intervals instead of one wide one. The simulations are said to beat existing methods on coverage and set size, and the electricity data example is used to show practical behavior. That flexibility for non-connected sets is a clear plus over standard interval methods for sequential data.

Referee Report

3 major / 2 minor

Summary. The paper proposes Sequential Conformalized Density Regions (SCDR), a conformal prediction procedure for time-series data. It begins with an estimated conditional highest-density region from a predictive density model, then applies a quantile random forest adjustment on conformity scores to achieve asymptotic conditional coverage while accommodating non-exchangeability. The central claim is that the method attains the target coverage rate asymptotically under regularity conditions and is doubly robust: coverage holds if the initial density model is correctly specified or if the scores follow a nonlinear autoregressive process of correctly specified order. The approach is illustrated on simulations and two real datasets (Old Faithful geyser durations and Australian electricity usage), where it produces possibly disconnected prediction sets and is reported to outperform existing methods in coverage and set size.

Significance. If the asymptotic conditional coverage result and double-robustness property can be established with explicit regularity conditions and verifiable assumptions, the work would contribute a flexible conformal method for dependent data that permits multimodal or disconnected sets. The combination of density-based initialization with quantile-forest adjustment is a technically interesting direction for handling time-series dependence without requiring full exchangeability.

major comments (3)

[Abstract and §3] Abstract and §3 (method description): the double-robustness claim requires that, when the conditional density is misspecified, the quantile random forest exactly recovers the conditional quantiles of a nonlinear autoregressive score process of known order. No data-driven procedure for selecting or validating this order is described, and the paper does not quantify coverage degradation when the assumed order is off by one lag. This assumption is load-bearing for the robustness statement and must be accompanied by either a selection algorithm or a sensitivity analysis.
[Abstract] Abstract: the asymptotic conditional coverage guarantee is stated to hold under 'certain regularity conditions' (mixing rates, smoothness of the score conditional distribution, etc.), yet neither the explicit list of conditions nor a proof sketch or reference to the derivation appears in the provided abstract or high-level description. Finite-sample bounds are also absent. These elements are required to substantiate the central theoretical claim.
[Simulations and real-data examples] Simulations and real-data sections: the reported outperformance in empirical coverage and set size must be accompanied by explicit checks that the double-robustness cases (correct density vs. correct NAR order) are separately validated; otherwise the simulation results cannot isolate which component drives the improvement.

minor comments (2)

[§2] Notation for conformity scores and the highest-density region should be introduced with a single consistent symbol set early in the paper to avoid ambiguity when the quantile forest is applied.
[§3] The description of how the quantile random forest is trained on the time-series scores (e.g., lag embedding, cross-validation for forest hyperparameters) needs a dedicated paragraph or algorithm box for reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their insightful comments, which have helped us identify areas for improvement in the manuscript. We provide point-by-point responses to the major comments below, indicating the changes we will implement in the revised version.

read point-by-point responses

Referee: [Abstract and §3] Abstract and §3 (method description): the double-robustness claim requires that, when the conditional density is misspecified, the quantile random forest exactly recovers the conditional quantiles of a nonlinear autoregressive score process of known order. No data-driven procedure for selecting or validating this order is described, and the paper does not quantify coverage degradation when the assumed order is off by one lag. This assumption is load-bearing for the robustness statement and must be accompanied by either a selection algorithm or a sensitivity analysis.

Authors: We agree with the referee that the order of the nonlinear autoregressive process is a critical assumption for the double-robustness property when the density model is misspecified. In the revised manuscript, we will describe a data-driven procedure for selecting the order using cross-validation based on the conformity scores from a hold-out set. Furthermore, we will include a sensitivity analysis in the simulation studies to quantify the degradation in coverage when the assumed order is incorrect by one lag, demonstrating the robustness of the method to small misspecifications. revision: yes
Referee: [Abstract] Abstract: the asymptotic conditional coverage guarantee is stated to hold under 'certain regularity conditions' (mixing rates, smoothness of the score conditional distribution, etc.), yet neither the explicit list of conditions nor a proof sketch or reference to the derivation appears in the provided abstract or high-level description. Finite-sample bounds are also absent. These elements are required to substantiate the central theoretical claim.

Authors: We appreciate this observation. The explicit regularity conditions and the full proof are provided in Section 4 of the manuscript. To address the concern, we will add a high-level proof sketch and a summarized list of the key regularity conditions (including mixing rates and smoothness assumptions) to the abstract and the introduction of the revised version. Finite-sample bounds are not derived in the current work as our primary contribution is the asymptotic guarantee; we will note this as an avenue for future research. revision: partial
Referee: [Simulations and real-data examples] Simulations and real-data sections: the reported outperformance in empirical coverage and set size must be accompanied by explicit checks that the double-robustness cases (correct density vs. correct NAR order) are separately validated; otherwise the simulation results cannot isolate which component drives the improvement.

Authors: We concur that separating the contributions of the two robustness components would strengthen the empirical validation. In the revised simulations section, we will add dedicated experiments that isolate the cases: (i) correctly specified density model with misspecified NAR order, (ii) misspecified density model with correctly specified NAR order, and (iii) both correctly specified. Similar checks will be discussed for the real-data examples where possible, to better attribute the observed improvements. revision: yes

Circularity Check

0 steps flagged

No circularity: asymptotic coverage derived from stated regularity conditions and external estimators, not reduced to inputs by construction

full rationale

The paper's central claim is an asymptotic conditional coverage guarantee for SCDR under explicit regularity conditions (mixing, smoothness, etc.), with double robustness stated as holding when either the initial conditional density is correctly specified or the conformity scores obey a correctly ordered NAR process. No equation or derivation step in the abstract or described method reduces the coverage result to a fitted quantity or self-citation by construction; the quantile random forest adjustment is presented as an external conformal step whose validity rests on the listed assumptions rather than being tautological with the inputs. The method builds on existing density estimators without redefining them via the target coverage. This is the common case of a self-contained proposal whose guarantees are conditional on verifiable modeling assumptions rather than circular.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on unspecified regularity conditions for asymptotic coverage and on the assumption that either the base density estimator or the score process is correctly specified; no free parameters, invented entities, or additional axioms are stated in the abstract.

axioms (2)

domain assumption Regularity conditions sufficient for asymptotic conditional coverage of the conformalized density regions
Invoked to obtain the guaranteed coverage rate; location not specified beyond the abstract statement.
domain assumption The nonconformity scores follow a nonlinear autoregressive model of correct order (alternative to correct density specification)
Part of the double-robustness claim.

pith-pipeline@v0.9.0 · 5509 in / 1421 out tokens · 36196 ms · 2026-05-10T17:53:42.103355+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

6 extracted references · 6 canonical work pages

[2]

The conditional point estimator ˆg(x) is uniformly consistent for the true regression functiong(x): sup x∈X |g(x)−ˆg(x)|=o p(1). 16

work page
[3]

∼Pand are independent ofX t

The true data generating process is a location family: Yt =g(X t) +ϵ t, whereϵ t i.i.d. ∼Pand are independent ofX t. Proof.Define the oracle scores as ˜Vt =Y t −g(X t) =g(X t) +ϵ t −g(X t) =ϵ t, which are i.i.d. by Assumption 3. Next, define the scores used in practice as Vt =Y t −ˆg(Xt). Finally, looking at the difference between the scores we have: sup ...

work page
[4]

The covariate space,X ⊆R p, is compact

work page
[5]

The conditional quantile estimators ˆqτ(x) are uniformly consistent for the true con- ditional quantilesq τ(x), forτ∈ {α low, αhigh}: sup x∈X |qτ(x)−ˆqτ(x)|=o p(1),forτ∈ {α low, αhigh}. 17

work page
[6]

∼Pand are independent ofX t

The true data generating process is a location family: Yt =g(X t) +ϵ t, whereϵ t i.i.d. ∼Pand are independent ofX t. Proof.By Assumption 3, the population quantiles ofY t areq τ(X t) =g(X t) +z τ, where zτ is theτ-th quantile of the error distribution. Define the oracle scores as: ˜Vt = max{qαlow(X t)−Y t, Yt −q αhigh(X t)} = max{g(X t) +z αlow −g(X t)−ϵ ...

work page 2001
[7]

The desired bound then follows by choosingK=ϵ −2/γ. 33 B.4 Joint Mixture Normals implies Conditional are Mixture Nor- mal For simplicity, we denote the density of a multivariate Normal distribution evaluated aty with meanµand varianceΣasN(y|µ,Σ). Recall that when the joint density is Normal i.e., not a mixture, both the marginal and conditional densities ...

work page 1998

[1] [2]

The conditional point estimator ˆg(x) is uniformly consistent for the true regression functiong(x): sup x∈X |g(x)−ˆg(x)|=o p(1). 16

work page

[2] [3]

∼Pand are independent ofX t

The true data generating process is a location family: Yt =g(X t) +ϵ t, whereϵ t i.i.d. ∼Pand are independent ofX t. Proof.Define the oracle scores as ˜Vt =Y t −g(X t) =g(X t) +ϵ t −g(X t) =ϵ t, which are i.i.d. by Assumption 3. Next, define the scores used in practice as Vt =Y t −ˆg(Xt). Finally, looking at the difference between the scores we have: sup ...

work page

[3] [4]

The covariate space,X ⊆R p, is compact

work page

[4] [5]

The conditional quantile estimators ˆqτ(x) are uniformly consistent for the true con- ditional quantilesq τ(x), forτ∈ {α low, αhigh}: sup x∈X |qτ(x)−ˆqτ(x)|=o p(1),forτ∈ {α low, αhigh}. 17

work page

[5] [6]

∼Pand are independent ofX t

The true data generating process is a location family: Yt =g(X t) +ϵ t, whereϵ t i.i.d. ∼Pand are independent ofX t. Proof.By Assumption 3, the population quantiles ofY t areq τ(X t) =g(X t) +z τ, where zτ is theτ-th quantile of the error distribution. Define the oracle scores as: ˜Vt = max{qαlow(X t)−Y t, Yt −q αhigh(X t)} = max{g(X t) +z αlow −g(X t)−ϵ ...

work page 2001

[6] [7]

The desired bound then follows by choosingK=ϵ −2/γ. 33 B.4 Joint Mixture Normals implies Conditional are Mixture Nor- mal For simplicity, we denote the density of a multivariate Normal distribution evaluated aty with meanµand varianceΣasN(y|µ,Σ). Recall that when the joint density is Normal i.e., not a mixture, both the marginal and conditional densities ...

work page 1998