pith. sign in

arxiv: 2006.12806 · v1 · submitted 2020-06-23 · 📊 stat.ME · stat.CO

Seeded intervals and noise level estimation in change point detection: A discussion of Fryzlewicz (2020)

Pith reviewed 2026-05-24 14:20 UTC · model grok-4.3

classification 📊 stat.ME stat.CO
keywords change point detectionnoise level estimationseeded intervalsmodel selectiontime series segmentationsignal-to-noise ratiopiecewise constant signals
0
0 comments X

The pith

A novel noise level estimator improves model selection for change point detection in frequent low-signal cases.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This discussion compares seeded intervals to random intervals for change point segmentation, weighing their merits on practical use, statistical performance, and computational cost. The authors also examine a new estimator for the underlying noise level. This estimator strengthens several standard model selection rules, including the steepest-drop criterion, with the largest gains appearing when changes occur often but the signal is weak relative to noise. A sympathetic reader would care because many real time series, from financial prices to genomic data, require reliable detection of shifts amid substantial noise.

Core claim

The paper compares the choice of seeded intervals and random intervals for change point segmentation from practical, statistical and computational perspectives. It further investigates a novel estimator of the noise level, which improves many existing model selection procedures including the steepest drop to low levels, particularly for challenging frequent change point scenarios with low signal-to-noise ratios.

What carries the argument

The novel noise level estimator used to refine model selection after interval-based change point search.

If this is right

  • Seeded intervals can be preferable to random intervals on grounds of computation and stability in segmentation tasks.
  • The noise estimator upgrades performance of multiple model-selection rules beyond the steepest-drop method.
  • Gains are most pronounced precisely when change points are numerous and the signal-to-noise ratio is low.
  • The estimator can be plugged into existing pipelines without altering their core search procedures.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The estimator may extend naturally to other variance-estimation problems in piecewise-constant signal models.
  • Its practical value would increase if shown to be robust under mild departures from the Gaussian noise assumption common in change-point theory.
  • Comparative studies could test whether seeded intervals plus the new estimator together outperform purely random-interval approaches on real data sets.

Load-bearing premise

The proposed noise level estimator delivers genuine improvements in model selection without introducing biases or depending on unstated data properties.

What would settle it

A Monte Carlo study in a frequent-change low-SNR regime where the new estimator yields no reduction in selection error or produces systematically biased change-point counts compared with existing methods.

Figures

Figures reproduced from arXiv: 2006.12806 by Housen Li, Peter B\"uhlmann, Solt Kov\'acs.

Figure 1
Figure 1. Figure 1: Boxplots of the JFNL and MAD estimators of the noise level σ in the extreme.teeth example of Fryzlewicz (2020) on the very left panel and five other examples from Fryzlewicz (2014) based on 1000 simulations each. The true values of σ are indicated by the red horizontal lines respectively. also improves SDLL methodology in noisy scenarios. SeedBS with a fixed threshold finds fewer change points, but with a … view at source ↗
Figure 2
Figure 2. Figure 2: Boxplots of the number of found change points (based on 100 simulations each) in the extreme.teeth example of Fryzlewicz (2020) with noise level σ = 0.3 (top) and σ = 0.45 (bottom) for various estimation approaches. Model selection approaches based on the JFNL estimator are in red, MAD based counterparts in black, while green refers to approaches from Du et al. (2016). The true number of change points is i… view at source ↗
read the original abstract

In this discussion, we compare the choice of seeded intervals and that of random intervals for change point segmentation from practical, statistical and computational perspectives. Furthermore, we investigate a novel estimator of the noise level, which improves many existing model selection procedures (including the steepest drop to low levels), particularly for challenging frequent change point scenarios with low signal-to-noise ratios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript compares seeded intervals versus random intervals for change point segmentation from practical, statistical, and computational perspectives. It further proposes and investigates a novel estimator of the noise level, claimed to improve many existing model selection procedures (including the steepest drop to low levels), especially in frequent change-point scenarios with low signal-to-noise ratios.

Significance. A well-validated noise-level estimator that demonstrably improves model selection in low-SNR, high-frequency regimes would be a useful addition to the change-point literature; the seeded-versus-random interval comparison may also supply practical guidance if supported by concrete metrics.

major comments (2)
  1. [Abstract] Abstract: the central claim that the novel noise-level estimator improves existing model-selection procedures is asserted without any explicit construction, formula, bias/variance analysis, or simulation results. The manuscript must supply the estimator definition and controlled comparisons in the stated regime before the improvement claim can be assessed.
  2. [Abstract] Abstract: no conditions, assumptions, or counter-examples are provided under which the claimed gains hold or fail; without these, it is impossible to determine whether reported improvements are genuine or artifacts of unstated data properties.
minor comments (1)
  1. The title references a discussion of Fryzlewicz (2020), yet the abstract does not explicitly delineate which results or procedures from that work are being revisited.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our discussion paper. We address each major comment below and agree that the abstract requires strengthening to better support the claims about the noise-level estimator.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the novel noise-level estimator improves existing model-selection procedures is asserted without any explicit construction, formula, bias/variance analysis, or simulation results. The manuscript must supply the estimator definition and controlled comparisons in the stated regime before the improvement claim can be assessed.

    Authors: The full manuscript supplies the explicit construction and formula for the novel noise-level estimator (Section 3), together with bias/variance analysis and controlled simulation comparisons against existing procedures in the low-SNR, frequent-change regime (Section 5). The abstract is deliberately concise and therefore omits these details. We will revise the abstract to include a brief description of the estimator and a pointer to the relevant sections, thereby making the improvement claim more transparent at the abstract level. revision: yes

  2. Referee: [Abstract] Abstract: no conditions, assumptions, or counter-examples are provided under which the claimed gains hold or fail; without these, it is impossible to determine whether reported improvements are genuine or artifacts of unstated data properties.

    Authors: We agree that an explicit delineation of the operating regime would strengthen the paper. The current text emphasizes the challenging frequent-change, low-SNR setting where gains are observed, but does not state the precise assumptions or provide counter-examples. We will add a short subsection (or paragraph in the discussion) that lists the key assumptions (minimum jump size, maximum number of changes, noise model) and includes a brief simulation or analytic counter-example illustrating when the estimator does not improve upon standard methods. revision: yes

Circularity Check

0 steps flagged

No circularity: discussion paper compares methods and proposes noise estimator without self-referential reductions.

full rationale

The paper is a discussion comparing seeded vs. random intervals and investigating a novel noise level estimator for change point detection. The abstract and context present this as a practical/statistical comparison and an empirical investigation of improvements in model selection, particularly under frequent changes and low SNR. No equations, derivations, or load-bearing steps are shown that reduce claims to self-definitions, fitted inputs renamed as predictions, or self-citation chains. The central claim of improvement is stated qualitatively without any exhibited construction that would make it tautological by the paper's own inputs. This is a normal non-finding for a discussion piece whose value rests on external validation rather than internal derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only abstract available, so ledger is minimal; no explicit free parameters, axioms, or invented entities are stated.

axioms (1)
  • domain assumption Standard statistical assumptions for change point models in time series (e.g., piecewise constant mean or variance with additive noise)
    The comparison and estimator implicitly rely on typical change point model assumptions common in the field.

pith-pipeline@v0.9.0 · 5588 in / 1083 out tokens · 21752 ms · 2026-05-24T14:20:54.323142+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

13 extracted references · 13 canonical work pages

  1. [1]

    Baranowski, R., Chen, Y., and Fryzlewicz, P. (2019). Narrowest-over-threshold detection of multiple change points and change-point-like features. Journal of the Royal Statistical Society, Series B , 81(3):649--672

  2. [2]

    M., and Kou, S

    Du, C., Kao, C.-L. M., and Kou, S. C. (2016). Stepwise signal extraction via marginal likelihood. Journal of the American Statistical Association , 111(513):314--330

  3. [3]

    and Rigaill, G

    Fearnhead, P. and Rigaill, G. (2020). Relating and comparing methods for detecting changes in mean. To appear in Stat, e291

  4. [4]

    Fryzlewicz, P. (2014). Wild binary segmentation for multiple change-point detection. The Annals of Statistics , 42(6):2243--2281

  5. [5]

    Fryzlewicz , P. (2020). Detecting possibly frequent change-points: Wild Binary Segmentation 2 and steepest-drop model selection . Journal of the Korean Statistical Society

  6. [6]

    W., and Titterington, D

    Hall, P., Kay, J. W., and Titterington, D. M. (1990). Asymptotically optimal difference-based estimation of variance in nonparametric regression. Biometrika , 77(3):521--528

  7. [7]

    Killick, R., Fearnhead, P., and Eckley, I. A. (2012). Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association , 107(500):1590--1598

  8. [8]

    Kov \'a cs , S., Li , H., B \"u hlmann , P., and Munk , A. (2020). Seeded binary segmentation: a general methodology for fast and optimal change point detection . arXiv:2002.06633

  9. [9]

    Kov\'acs, S., Li, H., Haubner, L., B\"uhlmann, P., and Munk, A. (2020). Optimistic search strategies: change point detection without full grid search . Working paper

  10. [10]

    Li, H., Munk, A., and Sieling, H. (2016). F DR -control in multiscale change-point segmentation. Electronic Journal of Statistics , 10(1):918--959

  11. [11]

    Londschien , M., Kov \'a cs , S., and B \"u hlmann , P. (2019). Change point detection for graphical models in the presence of missing values . To appear in the Journal of Computational and Graphical Statistics, arXiv:1907.05409

  12. [12]

    Vostrikova, L. Y. (1981). Detecting `disorder' in multidimensional random processes. Soviet Mathematics Doklady , 24:55--59

  13. [13]

    Yao, Y.-C. (1988). Estimating the number of change-points via Schwarz' criterion. Statistics & Probability Letters , 6(3):181--189