pith. sign in

arxiv: 2603.25971 · v2 · pith:C7SZGYMVnew · submitted 2026-03-26 · 📊 stat.ME · math.ST· stat.TH

Design-Based Anytime-Valid Inference for Randomized Experiments with Delayed Outcomes and Staggered Entry

classification 📊 stat.ME math.STstat.TH
keywords design-basedconfidenceerrorestimationfiltrationmartingaletreatmentarm-specific
0
0 comments X
read the original abstract

Delayed outcomes are ubiquitous in online experimentation: treatment can affect whether an outcome occurs, when it occurs, and its realized value. To accommodate staggered entry while remaining robust to environmental nonstationarity and unit-level heterogeneity, we adopt a design-based perspective and target the sample cumulative reward in each arm as a function of calendar time. Our confidence sequences allow practitioners to continuously monitor the counterfactual incremental reward, such as revenue, that would have been realized by calendar time $t$ had all entered units been assigned to treatment rather than control. The main technical challenge is the choice of design-based filtration, complicated by the presence of asynchronous potential outcome times. We show that the IPW treatment-effect estimation error is not a martingale with respect to any filtration, while each arm-specific IPW estimation error is a martingale with respect to a carefully chosen arm-specific event-time filtration. We therefore construct a confidence sequence for the treatment effect by combining two arm-level confidence sequences with a union bound, and further demonstrate that this can outperform the traditional design-based variance upper bound. Finally, we characterize the class of augmentations for which the per-arm AIPW estimation error remains a martingale.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Adaptive Experimentation for Censored Survival Outcomes

    cs.LG 2026-05 unverdicted novelty 7.0

    Introduces the Adaptive Survival Estimator (ASE) that derives a closed-form efficiency-optimal allocation policy for estimating the average survival effect curve under right censoring.