Seeing Below the Limit of Detection: A Censored-Poisson Bayesian Latent-Growth Change-Point Detector (the Span Detector) for Serial ctDNA in HR+/HER2- Metastatic Breast Cancer

Aarchi Singh Thakur; Abhijoy Sarkar

arxiv: 2606.11876 · v1 · pith:LWMCX4W3new · submitted 2026-06-10 · 🧬 q-bio.QM · cs.LG· stat.ME

Seeing Below the Limit of Detection: A Censored-Poisson Bayesian Latent-Growth Change-Point Detector (the Span Detector) for Serial ctDNA in HR+/HER2- Metastatic Breast Cancer

Aarchi Singh Thakur , Abhijoy Sarkar This is my paper

Pith reviewed 2026-06-27 07:52 UTC · model grok-4.3

classification 🧬 q-bio.QM cs.LGstat.ME

keywords ctDNAchange-point detectioncensored Poissonmetastatic breast cancerliquid biopsyearly progression detectionBayesian modeling

0 comments

The pith

Span treats non-detects as censored observations to double early detection of indolent breast cancer progression from serial ctDNA.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Span, a Bayesian change-point detector that models ctDNA variant detection as a censored Poisson process. It treats sequences of faint detects and non-detects as carrying cumulative evidence of upward shifts in detection rate, rather than discarding non-detects. On synthetic trajectories for HR+/HER2- metastatic breast cancer patients, the method raises alarms three months before progression at a matched 10% false-alarm rate, catching 25% of indolent cases versus 11% for independent snapshot analysis. The gain is regime-specific and disappears for fast-emerging resistance. This approach requires no training weights and isolates its benefit to the handling of left-censored observations.

Core claim

Span is a censored-Poisson Bayesian latent-growth change-point detector that models the binary detection process, accumulates a sequential generalised-likelihood-ratio statistic for an upward change-point in the per-variant detection rate, and raises competing-risks alarms with calibrated false-alarm control. On a synthetic cohort of HR+/HER2- metastatic breast cancer on CDK4/6-inhibitor plus endocrine therapy, at a matched 10% false-alarm rate, Span roughly doubles the fraction of impending progressions caught three months ahead in the indolent regime (25% vs 11% for the snapshot), with a falsifiable dose-response that is large for indolent emergence and vanishes for fast emergence. A value

What carries the argument

The Span detector, a censored-Poisson Bayesian latent-growth change-point model that accumulates evidence from left-censored non-detects to identify upward shifts in per-variant detection rate.

If this is right

The performance advantage appears only in indolent regimes where detection is intermittent and disappears for rapid progression.
Modeling detection patterns rather than measured values drives the gain, since a value-trajectory baseline matches the snapshot method.
The pipeline matches a Cox baseline on real survival data and correctly withholds improvement on a clean biomarker cohort, confirming regime specificity.
Absence of learned weights eliminates overfitting as an explanation for the reported results.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the synthetic trajectories match real detection intermittency, the same censored modeling could be tested on other liquid-biopsy settings with sub-LoD signals.
The dose-response pattern suggests stratifying future real-data trials by estimated progression speed to isolate where the method adds value.
Integration into monitoring would require prospective calibration of the false-alarm threshold against clinical action thresholds.

Load-bearing premise

The synthetic ctDNA trajectories accurately reproduce the intermittent detection statistics of real nascent subclones below the assay limit of detection.

What would settle it

A head-to-head comparison of Span versus snapshot methods on a real longitudinal ctDNA dataset from metastatic breast cancer patients, testing whether the doubling of three-month-ahead warnings at 10% false-alarm rate holds specifically for indolent progression cases.

Figures

Figures reproduced from arXiv: 2606.11876 by Aarchi Singh Thakur, Abhijoy Sarkar.

**Figure 2.** Figure 2: Mechanism, one synthetic patient. The true latent subclone (blue) grows beneath the LoD; the assay returns non-detects (red ×) and, as detection probability rises, intermittent faint detects (navy). Span accumulates change-point evidence from this pattern and alarms (teal) while the clone is still sub-LoD—here ahead of the variant becoming callable and well ahead of imaging—whereas the commodity snapshot n… view at source ↗

**Figure 3.** Figure 3: Real-data sanity check (GBSG-2, n=686). Left: Kaplan–Meier recurrence-free survival for risk tertiles assigned by the deep model (clear monotone separation). Right: held-out concordance, deep competing-risks head vs Cox. The backbone matches the gold-standard baseline on real biology. 7 Real-data boundary test: where trajectory should not win (PBC2) The synthetic results make a sharp, directional predictio… view at source ↗

**Figure 4.** Figure 4: Boundary test, real PBC2 data. On a clean, slow biomarker the snapshot (grey) matches or beats the trajectory head (blue) on lead time, 3-year AUC, and sensitivity at a matched 10% false-alarm rate. This is the negative control: the advantage that appears in the censored ctDNA regime is correctly absent where the signal is not censored, confirming the dose-response rather than contradicting it. 8 Limitatio… view at source ↗

read the original abstract

Circulating-tumour DNA (ctDNA) carries evidence of drug resistance months before imaging shows it, but the earliest evidence lives below the assay's limit of detection (LoD): a nascent subclone is detected only intermittently, producing a flickering sequence of faint detects and non-detects. Commercial liquid biopsies treat each draw as an independent snapshot and a non-detect as nothing. We argue a non-detect is a left-censored observation, and the pattern of non-detects and faint detects over time carries actionable evidence of growth before any single value is trustworthy. We introduce Span, a censored-Poisson Bayesian latent-growth change-point detector that models the binary detection process, accumulates a sequential generalised-likelihood-ratio statistic for an upward change-point in the per-variant detection rate, and raises a competing-risks alarm with calibrated false-alarm control. Span has no learned weights, so there is nothing to overfit. On a synthetic cohort of HR+/HER2- metastatic breast cancer on first-line CDK4/6-inhibitor plus endocrine therapy, at a matched 10% false-alarm rate, Span roughly doubles the fraction of impending progressions caught three months ahead (indolent regime: 25% vs 11% for the snapshot), with a falsifiable dose-response: large for indolent emergence, vanishing for fast emergence. A value-trajectory baseline performs identically to the snapshot, isolating the gain to the censored detection model. The survival backbone matches a Cox baseline on real breast-cancer data (GBSG-2, n=686; C-index 0.67 vs 0.68), and on a real longitudinal cohort with clean biomarkers (PBC2, n=312) the same pipeline correctly declines to win, a falsifiable boundary test confirming the mechanism is regime-specific. All ctDNA trajectories are synthetic.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Span introduces a parameter-free censored-Poisson change-point detector for ctDNA non-detects that improves early detection on synthetic trajectories but has no real ctDNA validation for its core claim.

read the letter

Span models non-detects in ctDNA as left-censored Poisson observations inside a Bayesian latent-growth setup and runs a sequential GLR test on the detection rate to flag upward change-points. The method has no learned weights. On synthetic HR+/HER2- breast cancer trajectories it roughly doubles the share of indolent progressions caught three months ahead at a matched 10% false-alarm rate, while a value-trajectory baseline adds nothing.

The clean parts are the absence of fitted parameters and the boundary tests on real data. The GBSG-2 Cox match and the PBC2 non-improvement both behave as expected, which shows the authors checked that the pipeline does not claim gains where the mechanism should not apply.

The soft spot is that every ctDNA performance number comes from synthetic trajectories. The abstract states plainly that no real longitudinal ctDNA series with verified nascent subclones below the limit of detection was used to calibrate or test the generator's intermittency, censoring, or growth statistics. The reported dose-response (gain only for indolent emergence) therefore rests on how faithfully those trajectories reproduce actual below-LoD flickering. That fidelity is untested.

The paper is for statisticians and oncologists working on longitudinal biomarker monitoring in metastatic cancer. Readers who care about censored data models or change-point methods for liquid biopsies will see the most value.

It should go to peer review. The architecture is distinct, the claims are set up to be falsifiable, and the synthetic results are presented with the right controls.

Referee Report

2 major / 2 minor

Summary. The paper introduces Span, a parameter-free censored-Poisson Bayesian latent-growth change-point detector for serial ctDNA monitoring in HR+/HER2- metastatic breast cancer. It treats non-detects as left-censored observations, accumulates a sequential GLR statistic on the per-variant detection rate, and issues competing-risks alarms with calibrated false-alarm control. On synthetic ctDNA trajectories, at matched 10% FAR, Span doubles the fraction of impending progressions detected three months ahead in the indolent regime (25% vs 11% for snapshot baseline); the gain vanishes for fast emergence. Real-data experiments on GBSG-2 (Cox C-index match) and PBC2 (correct non-improvement) serve as boundary tests. All ctDNA trajectories are synthetic.

Significance. If the synthetic generator's fidelity to real below-LoD intermittency holds, Span offers a practical, non-overfit method to extract earlier resistance signals from existing liquid-biopsy assays. The parameter-free construction, explicit falsifiable dose-response, and use of real cohorts as negative controls are methodological strengths that distinguish it from learned models. The work could influence serial ctDNA interpretation protocols if the synthetic-to-real gap is closed.

major comments (2)

[Abstract] Abstract (final sentence) and § on synthetic cohort: the central performance claim (25% vs 11% detection at 3 months, indolent regime) rests exclusively on synthetic trajectories; the manuscript provides no real longitudinal ctDNA series containing verified nascent subclones below LoD to calibrate or validate the generator's detection-rate process, censoring statistics, or growth-change parameters. This assumption is load-bearing for the dose-response claim.
[Real-data experiments] Real-data experiments section: GBSG-2 (n=686) and PBC2 (n=312) test only the survival-backbone Cox component and correctly show no spurious gain; they do not validate the censored-Poisson change-point mechanism on ctDNA, leaving the ctDNA-specific performance unanchored to real intermittent detection behavior.

minor comments (2)

[Methods] Clarify in the methods how the synthetic generator's parameters (e.g., detection probability below LoD, growth rates) were chosen and whether sensitivity analyses were performed.
[Figures] Figure legends should explicitly state that all ctDNA panels are synthetic and note the exact FAR matching procedure used for the snapshot baseline.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the detailed and constructive report. We agree that the central performance claims rest on synthetic ctDNA trajectories and that the real-data experiments do not validate the change-point mechanism itself. We respond to each major comment below and indicate the revisions we will make.

read point-by-point responses

Referee: [Abstract] Abstract (final sentence) and § on synthetic cohort: the central performance claim (25% vs 11% detection at 3 months, indolent regime) rests exclusively on synthetic trajectories; the manuscript provides no real longitudinal ctDNA series containing verified nascent subclones below LoD to calibrate or validate the generator's detection-rate process, censoring statistics, or growth-change parameters. This assumption is load-bearing for the dose-response claim.

Authors: We acknowledge that the performance numbers are generated exclusively from synthetic trajectories, as the manuscript already states. No real longitudinal ctDNA series with verified nascent subclones below LoD is used for calibration or validation of the generator. We will revise the abstract's final sentence to explicitly qualify the 25% vs 11% claim as synthetic and add a dedicated paragraph in the discussion that details the generator's modeling assumptions, the falsifiable dose-response property, and the current inability to anchor it to real below-LoD intermittency. revision: yes
Referee: [Real-data experiments] Real-data experiments section: GBSG-2 (n=686) and PBC2 (n=312) test only the survival-backbone Cox component and correctly show no spurious gain; they do not validate the censored-Poisson change-point mechanism on ctDNA, leaving the ctDNA-specific performance unanchored to real intermittent detection behavior.

Authors: This is correct. The GBSG-2 and PBC2 experiments are designed solely as boundary tests for the survival component and to demonstrate regime-specificity (no spurious improvement where none is expected). They do not claim to validate the censored-Poisson change-point detector on ctDNA. We will revise the section heading and text to label these experiments explicitly as negative controls for the non-ctDNA pipeline elements and to restate that ctDNA-specific evaluation remains synthetic. revision: yes

standing simulated objections not resolved

No real longitudinal ctDNA series containing verified nascent subclones below the limit of detection is available to the authors, so direct empirical validation or calibration of the synthetic generator's detection-rate and censoring process cannot be performed in the current work.

Circularity Check

0 steps flagged

No significant circularity; main claim is empirical performance on independently generated synthetic trajectories

full rationale

The paper's derivation chain consists of a parameter-free censored-Poisson change-point detector applied to a synthetic ctDNA cohort, with the headline performance (25% vs 11% detection at 3 months) obtained directly from that application. Real-data experiments (GBSG-2 Cox match, PBC2 non-improvement) are presented only as boundary tests confirming regime specificity, not as inputs that define the ctDNA result. No self-citations, fitted parameters renamed as predictions, or self-definitional steps appear in the provided text; the model explicitly has no learned weights. The synthetic generator's fidelity to real below-LoD statistics is an external validity question, not a reduction of the reported result to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Abstract-only review; full model specification, priors, and parameter definitions unavailable. The central claim rests on the domain assumption that detection events follow a censored Poisson process and on the ad-hoc assumption that the synthetic cohort reproduces real intermittent detection statistics.

axioms (2)

domain assumption Detection events follow a censored-Poisson process with latent growth rate that can exhibit an upward change-point
Core modeling choice stated in the abstract for the Span detector
ad hoc to paper Synthetic ctDNA trajectories accurately capture the statistical properties of real patient data below the limit of detection
All reported ctDNA performance results depend on this; abstract explicitly states all trajectories are synthetic

pith-pipeline@v0.9.1-grok · 5902 in / 1622 out tokens · 22765 ms · 2026-06-27T07:52:33.971207+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

9 extracted references

[1]

J. O. Brett et al. ESR1 mutation as an emerging clinical biomarker in metastatic HR+ breast cancer. Breast Cancer Research, 23:115, 2021

2021
[2]

Bidard et al

F.-C. Bidard et al. Switch to fulvestrant and palbociclib versus no switch in advanced breast cancer with rising ESR1 mutation (PADA-1).Lancet Oncology, 23(11):1367–1377, 2022

2022
[3]

Abbosh, A

C. Abbosh, A. M. Frankell et al. Tracking early lung-cancer evolution and ctDNA dynamics (TRACERx). Nature, 616:553–562, 2023

2023
[4]

C. Lee, W. R. Zame, J. Yoon, M. van der Schaar. DeepHit: a deep learning approach to survival analysis with competing risks.AAAI, 2018

2018
[5]

E. S. Page. Continuous inspection schemes.Biometrika, 41(1/2):100–115, 1954

1954
[6]

G. Lorden. Procedures for reacting to a change in distribution.Annals of Mathematical Statistics, 42(6):1897–1908, 1971

1908
[7]

J. Tobin. Estimation of relationships for limited dependent variables.Econometrica, 26(1):24–36, 1958

1958
[8]

Sarkar and A

A. Sarkar and A. Singh Thakur. OncoTraj v1: limits of single-snapshot tissue NGS for resistance prediction in EGFR-mutant NSCLC. Span AI technical report, 2026. 9

2026
[9]

Schumacher et al

M. Schumacher et al. Randomized2 ×2trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer (GBSG-2).Journal of Clinical Oncology, 12(10):2086–2093, 1994. 10

2086

[1] [1]

J. O. Brett et al. ESR1 mutation as an emerging clinical biomarker in metastatic HR+ breast cancer. Breast Cancer Research, 23:115, 2021

2021

[2] [2]

Bidard et al

F.-C. Bidard et al. Switch to fulvestrant and palbociclib versus no switch in advanced breast cancer with rising ESR1 mutation (PADA-1).Lancet Oncology, 23(11):1367–1377, 2022

2022

[3] [3]

Abbosh, A

C. Abbosh, A. M. Frankell et al. Tracking early lung-cancer evolution and ctDNA dynamics (TRACERx). Nature, 616:553–562, 2023

2023

[4] [4]

C. Lee, W. R. Zame, J. Yoon, M. van der Schaar. DeepHit: a deep learning approach to survival analysis with competing risks.AAAI, 2018

2018

[5] [5]

E. S. Page. Continuous inspection schemes.Biometrika, 41(1/2):100–115, 1954

1954

[6] [6]

G. Lorden. Procedures for reacting to a change in distribution.Annals of Mathematical Statistics, 42(6):1897–1908, 1971

1908

[7] [7]

J. Tobin. Estimation of relationships for limited dependent variables.Econometrica, 26(1):24–36, 1958

1958

[8] [8]

Sarkar and A

A. Sarkar and A. Singh Thakur. OncoTraj v1: limits of single-snapshot tissue NGS for resistance prediction in EGFR-mutant NSCLC. Span AI technical report, 2026. 9

2026

[9] [9]

Schumacher et al

M. Schumacher et al. Randomized2 ×2trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer (GBSG-2).Journal of Clinical Oncology, 12(10):2086–2093, 1994. 10

2086