Finite Population Sampling as n to N: Empirical Evidence for the Transition from Inference to Accuracy

Mike Crowhurst

arxiv: 2605.18691 · v1 · pith:7EYD3W6Enew · submitted 2026-05-18 · 📊 stat.ME · stat.CO

Finite Population Sampling as n to N: Empirical Evidence for the Transition from Inference to Accuracy

Mike Crowhurst This is my paper

Pith reviewed 2026-05-20 08:13 UTC · model grok-4.3

classification 📊 stat.ME stat.CO

keywords finite population samplingsampling fractionrandomization distributionsample meannumerical precisionadministrative datasensor datacentral limit theorem

0 comments

The pith

Sampling variability in finite populations becomes negligible well before the sample reaches the full population size, after which computational precision determines estimator accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper constructs two finite populations with known parameters and performs repeated sampling across a range of fractions from n to N. It tracks the randomization distribution of the sample mean to show how variability shrinks rapidly as the sampling fraction increases. The work is motivated by modern data sources that cover large portions of real populations, such as administrative records and sensor systems. A sympathetic reader would care because standard statistical inference assumes large or infinite populations where sampling error persists, but here that assumption breaks down early. Once variability from sampling is minimized, the paper finds that remaining errors stem from numerical precision in calculations rather than randomness.

Core claim

The paper establishes that as the sampling fraction f = n/N approaches 1, the randomization distribution of the sample mean narrows such that sampling variability becomes negligible well before full enumeration. At that point, deviations of the sample mean from the population mean are attributable mainly to numerical precision and the structure of the computational methods used, rather than to the random sampling process itself.

What carries the argument

The randomization distribution of the sample mean obtained through repeated sampling from constructed finite populations at varying sampling fractions f = n/N.

Load-bearing premise

That the constructed finite populations and the repeated-sampling experiments adequately represent the behavior in actual high-coverage administrative and sensor data streams.

What would settle it

If repeated sampling experiments on similar finite populations showed that the variability of the sample mean did not become negligible until the sampling fraction was very close to 1, or if deviations at high fractions were still dominated by sampling rather than computation.

Figures

Figures reproduced from arXiv: 2605.18691 by Mike Crowhurst.

**Figure 1.** Figure 1: Randomization distributions with n’s of 30, 100, 1000, and 100,000 (generated by A.I.) This question is no longer purely theoretical as advances in data collection, storage, and computational capacity have led to widespread use of systems that generate near-enumeration data rather than small random samples. Cochran (Cochran, 1977) and others (Lohr, 2021) (Sarmda & . C.-E., 1992) note that: 𝑉𝑎𝑟ሺ𝑦തሻ ≈ ቀ1 − 𝑛… view at source ↗

**Figure 2.** Figure 2: Estimator Deviation vs. Sampling Fraction [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

read the original abstract

The Central Limit Theorem provides a foundation for inferential statistics and hypothesis testing. It describes how standardized statistics behave under repeated sampling from large populations. However, if the size of the sample (n) becomes so large that it approaches the size of the population (N), sampling variability becomes very small, and standard errors and margins of error both approach zero. The purpose of this project was to investigate the behavior of estimators as the sampling fraction (f = n/N) approaches 1, motivated by modern data streams from administrative records, transaction logs, sensor systems, and institutional databases that capture large portions of finite populations. We constructed two finite populations with known parameters and drew repeated samples across a range of sampling fractions. We then examined the resulting randomization distributions of the sample mean to understand how sampling variability collapses. Additional experiments were conducted using various CPU- and GPU-based methods to evaluate the deviation of the sample mean from the defined population mean under different computational conditions. The results confirm that sampling variability diminishes as expected under finite population theory and becomes negligible well before full enumeration is reached. Once sampling variability is minimized, remaining deviations in estimators are primarily related to numerical precision and computational structure rather than random sampling. These findings support a reassessment of inferential assumptions in high-coverage, large-scale data settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript presents an empirical investigation into the behavior of statistical estimators, specifically the sample mean, as the sampling fraction f = n/N approaches 1 in finite populations. Motivated by high-coverage data from administrative records and sensor systems, the authors construct two finite populations with known parameters, perform repeated sampling across a range of fractions, and analyze the randomization distributions. They also explore computational methods to assess deviations from the population mean. The key finding is that sampling variability collapses as predicted by finite population theory, becoming negligible before full enumeration, after which residual errors are attributed to numerical precision and computational structure.

Significance. If substantiated with sufficient experimental details and checks on representativeness, this work could contribute to understanding the practical limits of classical inference in modern high-coverage data settings. It offers simulation evidence consistent with finite population correction factors and highlights a potential shift toward accuracy-focused evaluation once sampling variability is minimal. The computational experiments add relevance to current data environments, though the absence of quantitative parameters and real-data validation currently constrains broader impact.

major comments (3)

Abstract: The description of the two constructed finite populations and repeated-sampling procedure provides no quantitative details on population sizes N, sampling fractions f or corresponding n values, number of replications, or the exact sampling design (e.g., SRS without replacement). These omissions are load-bearing because the central claim that variability becomes negligible well before full enumeration cannot be verified or assessed for rate without them.
Motivation and experimental design paragraphs: The claim that remaining deviations are due to numerical precision rather than sampling requires explicit isolation of computational effects (e.g., via controlled comparisons to exact arithmetic or finite-population variance formulas). The current setup on two constructed populations does not demonstrate such isolation and risks confounding with population construction choices.
Motivation paragraph: The assertion that the observed collapse supports reassessment of inferential assumptions in administrative/sensor streams rests on the untested assumption that the two constructed populations capture relevant features such as dependence or heterogeneity; no evidence or sensitivity checks are provided to establish representativeness.

minor comments (2)

Clarify the precise definition and range of sampling fractions examined and ensure consistent use of notation for f = n/N across text and any figures.
The abstract's reference to 'various CPU- and GPU-based methods' would benefit from a brief description of the specific computational conditions tested.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their insightful comments and the opportunity to improve our manuscript. We address each of the major comments point by point below.

read point-by-point responses

Referee: Abstract: The description of the two constructed finite populations and repeated-sampling procedure provides no quantitative details on population sizes N, sampling fractions f or corresponding n values, number of replications, or the exact sampling design (e.g., SRS without replacement). These omissions are load-bearing because the central claim that variability becomes negligible well before full enumeration cannot be verified or assessed for rate without them.

Authors: We agree that the abstract should include these quantitative details to support verification of the claims. The full manuscript describes the population sizes, sampling fractions, replications, and sampling design in the experimental setup section. We will revise the abstract to incorporate this information, making the key parameters explicit for readers. revision: yes
Referee: Motivation and experimental design paragraphs: The claim that remaining deviations are due to numerical precision rather than sampling requires explicit isolation of computational effects (e.g., via controlled comparisons to exact arithmetic or finite-population variance formulas). The current setup on two constructed populations does not demonstrate such isolation and risks confounding with population construction choices.

Authors: The referee correctly identifies the importance of isolating computational effects. Our work includes experiments across different computational methods to show that deviations remain after sampling variability collapses. To better demonstrate isolation, we will add explicit comparisons using finite population variance formulas and controlled numerical precision tests in the revised manuscript. revision: yes
Referee: Motivation paragraph: The assertion that the observed collapse supports reassessment of inferential assumptions in administrative/sensor streams rests on the untested assumption that the two constructed populations capture relevant features such as dependence or heterogeneity; no evidence or sensitivity checks are provided to establish representativeness.

Authors: We recognize that the constructed populations are idealized to have known parameters and do not necessarily include all features of real data such as dependence or heterogeneity. The results illustrate the general behavior predicted by finite population theory. We will add a limitations section discussing representativeness and include sensitivity checks with varied population structures in the revision to strengthen the motivation. revision: partial

Circularity Check

0 steps flagged

No circularity: purely empirical simulation on constructed populations

full rationale

The paper conducts repeated-sampling simulations on two explicitly constructed finite populations to observe how randomization distributions of the sample mean behave as the sampling fraction approaches 1. All reported results (collapse of sampling variability, attribution of residuals to numerical precision) are direct outputs of these Monte Carlo experiments rather than any derived equations, fitted parameters renamed as predictions, or self-citation chains. No load-bearing steps invoke uniqueness theorems, ansatzes from prior work, or reductions of the central claim to its own inputs; the work is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The paper rests on the standard assumptions of finite population sampling theory (known population parameters, simple random sampling without replacement) without introducing new axioms, free parameters, or invented entities. No explicit fitting of constants is described in the abstract.

pith-pipeline@v0.9.0 · 5755 in / 1277 out tokens · 42046 ms · 2026-05-20T08:13:38.083398+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

5 extracted references · 5 canonical work pages

[1]

Cochran, W. (1977). Sampling Techniques (3rd ed.). Wiley. de Moivre, A. (1733). Approximatio ad summam terminorum binomii (a+b)^n in seriem expansi. Philosophical Transactions of the Royal Society, 37, 418-430

work page 1977
[2]

Feller, W. (1971). An Introduction to Probability Theory and Its Applications (2nd ed., V ol. 2). Wiley

work page 1971
[3]

Groves, R. M. (2011). Survey methodology. John Wiley & Sons. Jianfeng Chi, Y . T. (2020). Understanding and Mitigating Accuracy Disparity in Regression. ICLR submission

work page 2011
[4]

Lohr, S. (2021). Sampling: Design and Analysis (3rd ed.). Chapman & Hall/CRC

work page 2021
[5]

Lyapunov, A. (1901). Nouvelle forme du theoreme sur la limite de probabilite. Memoires del Academie Imperiale des Sciences de Saint Petersbourg, 12(5), 1-24. Sarmda, & . C.-E., S. B. (1992). Model Assistend Survey Sampling. Springer. AI-based tools were used for language refinement and for assistance in developing Python code used in the simulations. All ...

work page 1901

[1] [1]

Cochran, W. (1977). Sampling Techniques (3rd ed.). Wiley. de Moivre, A. (1733). Approximatio ad summam terminorum binomii (a+b)^n in seriem expansi. Philosophical Transactions of the Royal Society, 37, 418-430

work page 1977

[2] [2]

Feller, W. (1971). An Introduction to Probability Theory and Its Applications (2nd ed., V ol. 2). Wiley

work page 1971

[3] [3]

Groves, R. M. (2011). Survey methodology. John Wiley & Sons. Jianfeng Chi, Y . T. (2020). Understanding and Mitigating Accuracy Disparity in Regression. ICLR submission

work page 2011

[4] [4]

Lohr, S. (2021). Sampling: Design and Analysis (3rd ed.). Chapman & Hall/CRC

work page 2021

[5] [5]

Lyapunov, A. (1901). Nouvelle forme du theoreme sur la limite de probabilite. Memoires del Academie Imperiale des Sciences de Saint Petersbourg, 12(5), 1-24. Sarmda, & . C.-E., S. B. (1992). Model Assistend Survey Sampling. Springer. AI-based tools were used for language refinement and for assistance in developing Python code used in the simulations. All ...

work page 1901