Difference-in-differences with as few as two cross-sectional units -- A new perspective to the democracy-growth debate

Emmanuel Selorm Tsyawo; Gilles Koumou

arxiv: 2408.13047 · v5 · submitted 2024-08-23 · 💰 econ.EM

Difference-in-differences with as few as two cross-sectional units -- A new perspective to the democracy-growth debate

Gilles Koumou , Emmanuel Selorm Tsyawo This is my paper

Pith reviewed 2026-05-23 21:35 UTC · model grok-4.3

classification 💰 econ.EM

keywords difference-in-differencestemporal DiDunit-specific ATTdemocracy and growthBeninasymptotic normalityidentification test

0 comments

The pith

A temporal difference-in-differences estimator estimates unit-specific treatment effects with only two cross-sectional units and finds democratization increased Benin's average growth.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces the Temporal Difference-in-Differences estimator to estimate the effect of a treatment like democratization on a single unit's outcome using time-series variation and at least one control unit. It establishes asymptotic normality of the estimator under conditions including asymptotic parallel trends, limited anticipation, and temporal dependence. The approach also provides an identification test for parallel trends that works in post-treatment periods. In an application to Benin, the estimator indicates the economy would have been 6.4 percent smaller on average from 1993 to 2018 without democratization.

Core claim

The T-DiD estimator leverages temporal variation in the data to estimate unit-specific average treatment effects on the treated (ATT) with as few as two cross-sectional units. Under asymptotic parallel trends, limited anticipation, and temporal dependence conditions, the proposed DiD estimator is shown to be asymptotically normal. Provided at least two control units are available, the method is further complemented with an identification test that, unlike pre-trends tests, is more powerful and can detect violations of parallel trends in post-treatment periods. Empirical results using the DiD estimator suggest Benin's economy would have been 6.4% smaller on average over the 1993-2018 period.

What carries the argument

The Temporal Difference-in-Differences (T-DiD) estimator that uses temporal variation to identify unit-specific ATT.

If this is right

Unit-specific effects can be estimated without large panels of cross-sectional units.
The estimator is asymptotically normal, enabling standard inference.
An identification test can detect parallel trends violations after treatment.
The application shows a positive effect of democratization on economic growth in Benin.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the method holds, many single-country policy changes can be evaluated with minimal data requirements.
The test for identification could be used in other settings with few units to validate assumptions.
Results for Benin suggest re-evaluating democracy-growth links with country-specific methods rather than pooled panels.

Load-bearing premise

The asymptotic parallel trends condition holds for the temporal variation in the data.

What would settle it

Finding that the parallel trends assumption is violated in the post-treatment period for the Benin democratization study would invalidate the 6.4 percent estimate.

read the original abstract

Pooled panel analyses often mask heterogeneity in unit-specific treatment effects. This challenge, for example, crops up in studies of the impact of democracy on economic growth, where findings vary substantially due to differences in country composition. To address this challenge, this paper introduces the Temporal Difference-in-Differences (T-DiD) estimator that leverages temporal variation in the data to estimate unit-specific average treatment effects on the treated (ATT) with as few as two cross-sectional units. Under asymptotic parallel trends, limited anticipation, and temporal dependence conditions, the proposed DiD estimator is shown to be asymptotically normal. Provided at least two control units are available, the method is further complemented with an identification test that, unlike pre-trends tests, is more powerful and can detect violations of parallel trends in post-treatment periods. Empirical results using the DiD estimator suggest Benin's economy would have been 6.4% smaller on average over the 1993-2018 period had she not democratised.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

T-DiD gives unit-specific ATTs with N=2 by using time-series variation, but asymptotic parallel trends remains the load-bearing assumption and the Benin 6.4% result stands or falls with it.

read the letter

The paper's main contribution is the T-DiD estimator, which estimates unit-specific average treatment effects on the treated using only temporal variation when you have as few as two cross-sectional units. It shows asymptotic normality under asymptotic parallel trends, limited anticipation, and temporal dependence, and adds a post-treatment identification test that is claimed to be more powerful than standard pre-trends checks. The Benin application reports that the economy would have been 6.4% smaller without democratization from 1993-2018. That is the concrete new piece: a method aimed at the small-N problem that shows up often in cross-country growth work. It does a service by highlighting how pooled estimates can mask heterogeneity and by trying to deliver per-unit results without needing large samples. The stress-test note is right that the parallel trends condition does not get relaxed by fixing N at two; any fixed violation produces bias that does not average away, so the test can only detect departures after the fact. The empirical headline therefore inherits the same untested requirement. The paper is aimed at applied researchers who work with short panels in political economy or development and at methodologists who want to see how the temporal-variation approach and the new test perform in practice. A reader who cares about small-N identification would get something out of it. It deserves peer review because the problem it targets is common and the proposal is a clear methodological step, even though the identifying assumption will need careful scrutiny in revision.

Referee Report

3 major / 2 minor

Summary. The paper introduces the Temporal Difference-in-Differences (T-DiD) estimator to recover unit-specific average treatment effects on the treated (ATT) from temporal variation alone, applicable when the number of cross-sectional units N is as small as 2 and T is large. It states that the estimator is asymptotically normal under asymptotic parallel trends, limited anticipation, and temporal dependence conditions. The manuscript also proposes an identification test for parallel-trends violations that can be applied in post-treatment periods. In the empirical application, the T-DiD estimator is used to conclude that Benin’s economy would have been 6.4 percent smaller on average over 1993–2018 in the absence of democratization.

Significance. If the asymptotic results and the identification test are valid, the approach would allow unit-specific causal estimates in the small-N large-T environments typical of cross-country growth studies, addressing the heterogeneity problem that arises in pooled panel regressions of democracy on growth. The post-treatment identification test could strengthen the credibility of DiD designs where pre-trends tests are uninformative. The practical value, however, depends on whether the asymptotic parallel trends condition can be maintained at the rate required for √T-consistency when N is fixed at 2.

major comments (3)

[Abstract] Abstract and theoretical section: the asymptotic normality result is derived under the joint conditions of asymptotic parallel trends, limited anticipation, and temporal dependence, yet with N fixed at 2 any fixed (non-vanishing) violation of parallel trends produces a non-vanishing bias term that cannot be averaged away; the manuscript does not supply the rate condition on the violation or Monte Carlo evidence showing robustness at plausible violation rates.
[Empirical application] Empirical application (Benin results): the headline 6.4 percent figure is reported without standard errors, confidence intervals, or any indication of the underlying data sources, variable definitions, or sample construction, preventing assessment of whether the asymptotic parallel trends condition is plausible in the 1993–2018 period.
[Identification test section] Identification test: the claim that the proposed post-treatment test is “more powerful” than conventional pre-trends tests is stated without a formal power analysis, local-alternative derivation, or simulation study comparing size and power under the maintained temporal-dependence conditions.

minor comments (2)

[Abstract] The abstract introduces the acronym T-DiD without spelling out “Temporal Difference-in-Differences” on first use.
[Notation and definitions] Notation for the unit-specific ATT and the precise definition of the “asymptotic parallel trends” condition should be stated explicitly in the main text before the asymptotic normality theorem is invoked.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate planned revisions.

read point-by-point responses

Referee: [Abstract] Abstract and theoretical section: the asymptotic normality result is derived under the joint conditions of asymptotic parallel trends, limited anticipation, and temporal dependence, yet with N fixed at 2 any fixed (non-vanishing) violation of parallel trends produces a non-vanishing bias term that cannot be averaged away; the manuscript does not supply the rate condition on the violation or Monte Carlo evidence showing robustness at plausible violation rates.

Authors: We agree that with N fixed at 2, asymptotic normality requires the parallel-trends violation to vanish at a rate that makes the bias term o_p(T^{-1/2}). The manuscript states the asymptotic parallel trends assumption but does not derive the explicit rate or supply Monte Carlo evidence. We will add the required rate condition to the theoretical section and include Monte Carlo simulations demonstrating performance under plausible violation rates. revision: yes
Referee: [Empirical application] Empirical application (Benin results): the headline 6.4 percent figure is reported without standard errors, confidence intervals, or any indication of the underlying data sources, variable definitions, or sample construction, preventing assessment of whether the asymptotic parallel trends condition is plausible in the 1993–2018 period.

Authors: The referee is correct that the empirical section omits standard errors, confidence intervals, and full data documentation. We will revise the application to report standard errors and confidence intervals for the 6.4 percent estimate and to include complete details on data sources, variable definitions, and sample construction. revision: yes
Referee: [Identification test section] Identification test: the claim that the proposed post-treatment test is “more powerful” than conventional pre-trends tests is stated without a formal power analysis, local-alternative derivation, or simulation study comparing size and power under the maintained temporal-dependence conditions.

Authors: The manuscript claims greater power on the basis of post-treatment data availability, but we agree that no formal power analysis or simulation study is provided. We will either add a simulation study under the maintained temporal-dependence conditions or qualify the claim accordingly in the revision. revision: partial

Circularity Check

0 steps flagged

No circularity: T-DiD asymptotic normality derived under explicit assumptions

full rationale

The paper claims asymptotic normality of the unit-specific ATT estimator under the joint conditions of asymptotic parallel trends, limited anticipation, and temporal dependence (with N fixed, T→∞). This is a standard derivation from stated regularity conditions rather than a reduction by construction to fitted parameters, self-citations, or renamed inputs. The identification test is presented as an additional tool that can detect post-treatment violations but does not relax or replace the core assumptions. No self-definitional steps, fitted-input predictions, or load-bearing self-citations appear in the derivation chain. The Benin 6.4% figure is an application of the estimator, not a circular output.

Axiom & Free-Parameter Ledger

0 free parameters · 3 axioms · 0 invented entities

The central claim rests on three domain assumptions for identification and asymptotics; no free parameters or invented entities are mentioned in the abstract.

axioms (3)

domain assumption Asymptotic parallel trends condition
Invoked as the basis for identification of unit-specific ATT and asymptotic normality of the estimator.
domain assumption Limited anticipation
Required to ensure treatment effects are not anticipated before the policy change.
domain assumption Temporal dependence conditions
Needed for the asymptotic normality result.

pith-pipeline@v0.9.0 · 5708 in / 1431 out tokens · 20574 ms · 2026-05-23T21:35:20.940055+00:00 · methodology

Difference-in-differences with as few as two cross-sectional units -- A new perspective to the democracy-growth debate

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)