Sequentially-Rerandomized Switchback Experiments
Pith reviewed 2026-05-13 20:43 UTC · model grok-4.3
The pith
Sequentially rerandomizing switchback experiments balances lagged outcomes and covariates to reduce variance.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
SRSB re-randomizes treatment assignment at every time period so that pre-specified prognostic variables constructed from past observations remain balanced; this leverages temporal dependence to improve precision in the absence of carryover while preserving a known randomization distribution that supports both exact finite-sample inference under the sharp null and asymptotic inference as the number of periods increases.
What carries the argument
Sequentially-Rerandomized Switchback (SRSB) design, which at each period draws a new treatment vector that satisfies balance constraints on lagged outcomes and covariates.
Load-bearing premise
Pre-specified prognostic variables built from past observations can be balanced at each period without breaking the randomization distribution or creating selection bias.
What would settle it
Run parallel standard switchback and SRSB experiments on the same platform data with no carryover; if the empirical variance of the SRSB estimator is not smaller, the precision claim is false.
Figures
read the original abstract
Large-scale online platforms and marketplace systems often evaluate new policies through experiments that randomize treatment across operational units (e.g., geographies, regions, or clusters) over many time periods. In these settings, standard A/B testing can be inefficient or unreliable due to a limited number of units, substantial cross-unit heterogeneity, non-stationarity, and potential carryover across periods. We propose Sequentially-Rerandomized Switchback Experiments (SRSB), a new experimental design that helps mitigate these challenges. SRSB re-randomizes treatment at each time period such as to enforce balance on pre-specified prognostic variables constructed from past observations. In the absence of carryover, SRSB improves precision by leveraging temporal dependence through balancing lagged outcomes and covariates; we develop finite-sample randomization inference under a sharp null as well as asymptotic inference as the number of periods grows. We then extend SRSB to settings with first-order carryover and introduce a blocked SRSB variant that rerandomizes within strata defined by the previous treatment to form stable and comparable "stay" groups. Extensive simulations demonstrate the practical gains and robustness of SRSB relative to standard switchback designs.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Sequentially-Rerandomized Switchback Experiments (SRSB) for multi-period policy evaluation on online platforms. SRSB re-randomizes treatment assignments each period to enforce balance on pre-specified prognostic variables constructed from past observations. In the absence of carryover, it improves precision by leveraging temporal dependence through balancing lagged outcomes and covariates; finite-sample randomization inference under a sharp null and asymptotic inference as the number of periods grows are developed. The design is extended to first-order carryover via a blocked SRSB variant that rerandomizes within strata defined by the previous treatment to form stable groups. Extensive simulations are used to demonstrate gains relative to standard switchback designs.
Significance. If the inference procedures hold, SRSB provides a practical way to increase precision in switchback experiments with temporal dependence, limited units, and heterogeneity while retaining randomization-based inference. The dual finite-sample and asymptotic guarantees, together with the carryover extension, would strengthen the toolkit for marketplace and platform experimentation.
major comments (2)
- [Abstract / finite-sample RI section] The central claim of exact finite-sample randomization inference under the sharp null (Abstract) is load-bearing. Sequential balancing on lagged outcomes, which are functions of prior treatments and responses, conditions the current-period assignment on quantities that depend on potential outcomes. This can restrict the support of the realized randomization distribution in a non-uniform way and introduce correlation with the potential outcomes, violating the exchangeability required for exact type-I-error control of the randomization test.
- [Carryover extension] The blocked-strata extension for first-order carryover (Abstract) inherits the same conditioning issue and adds dependence induced by the previous-treatment strata. The manuscript must demonstrate that the conditional randomization distribution remains independent of the potential outcomes under the sharp null; otherwise the exactness claim does not extend to this variant.
minor comments (2)
- [Design section] Provide explicit algorithmic pseudocode or a step-by-step description of how the sequential balancing is implemented without violating the pre-specified randomization probabilities.
- [Simulations] The simulation section would benefit from tabulated coverage rates and power comparisons under both the sharp null and local alternatives, with clear statements of the data-generating processes and any exclusion rules.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review. The points raised about the exact finite-sample randomization inference are central, and we address them directly below. We maintain that the claims hold under the sharp null due to invariance of outcomes, but will revise the manuscript to clarify this reasoning explicitly.
read point-by-point responses
-
Referee: [Abstract / finite-sample RI section] The central claim of exact finite-sample randomization inference under the sharp null (Abstract) is load-bearing. Sequential balancing on lagged outcomes, which are functions of prior treatments and responses, conditions the current-period assignment on quantities that depend on potential outcomes. This can restrict the support of the realized randomization distribution in a non-uniform way and introduce correlation with the potential outcomes, violating the exchangeability required for exact type-I-error control of the randomization test.
Authors: Under the sharp null of no treatment effect whatsoever, potential outcomes are identical for every unit regardless of treatment history. Consequently, all lagged outcomes and derived prognostic variables are fixed quantities that do not vary with the realized treatment sequence. The sequential balancing therefore conditions only on these invariant observed values, so the randomization distribution at each step remains fully specified and independent of the (fixed) potential outcomes. This restores the exchangeability needed for exact type-I error control of the randomization test. We will add a short clarifying subsection in the finite-sample inference section that states this invariance explicitly and shows why the support of the randomization distribution is unaffected. revision: partial
-
Referee: [Carryover extension] The blocked-strata extension for first-order carryover (Abstract) inherits the same conditioning issue and adds dependence induced by the previous-treatment strata. The manuscript must demonstrate that the conditional randomization distribution remains independent of the potential outcomes under the sharp null; otherwise the exactness claim does not extend to this variant.
Authors: The same invariance argument applies to the blocked SRSB variant. Under the sharp null, outcomes are unaffected by prior treatment, so the strata defined by the previous treatment are formed solely by the design; the outcome values inside each stratum remain fixed and identical across units. Re-randomization within strata therefore proceeds according to a conditional distribution that is still independent of the potential outcomes. We will insert a brief proof sketch immediately after the description of the blocked design showing that the conditional randomization measure under the sharp null does not depend on the potential outcomes. revision: partial
Circularity Check
No significant circularity; design and inference are self-contained via explicit randomization rules
full rationale
The paper defines SRSB through explicit, pre-specified randomization rules that balance prognostic variables constructed from past observations without fitting parameters to the target estimands. Finite-sample randomization inference under the sharp null is derived directly from the induced randomization distribution, and asymptotic results follow from standard period-growth arguments. No load-bearing step reduces by construction to a fitted input, self-citation chain, or renamed ansatz; the central claims remain independent of the outcomes they target.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Treatment can be re-randomized each period while maintaining a valid randomization distribution
- domain assumption Prognostic variables from past observations are available and balanceable without bias
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SRSB re-randomizes treatment at each time period such as to enforce balance on pre-specified prognostic variables constructed from past observations.
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We develop finite-sample randomization inference under a sharp null as well as asymptotic inference as the number of periods grows.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
For allj≥2, tj −t j−1 ≥2
-
[2]
Assume that V 2 J /E[V 2 J ] P− →1 asT→ ∞
Define the blockwise predictable variance V 2 J = JX j=1 E tj+1 −1X t=tj +1 (bτt −τ t) 2 Ftj −1 . Assume that V 2 J /E[V 2 J ] P− →1 asT→ ∞
-
[3]
Z2 j ν2 j I(|Z j| ≥ϵ) # ν2 j ≤max 1≤j≤JT E
Let a2 t = ∥bτt −τ t∥2 2 S2 T = Var(bτt) S2 T . Assume: JX j=2 a2 tj →0, TX t=2 a2 t =O(1),max 1≤j≤J tj+1 −1X t=tj +1 a2 t →0.(11) 28 Then we have (T−1)(bτ−¯τ) ST d− →N(0,1). Most conditions are similar to Theorem 2 in the main text, except the third condition. Equation (11) ensures the contribution from the first time period of each block is negligible, ...
work page 1977
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.