Is Productivity Advantage of Cities Really Down To Mean and Variance?

Andrea Sy; Vladislav Morozov

arxiv: 2604.13188 · v1 · submitted 2026-04-14 · 💰 econ.EM · stat.AP

Is Productivity Advantage of Cities Really Down To Mean and Variance?

Vladislav Morozov , Andrea Sy This is my paper

Pith reviewed 2026-05-10 13:31 UTC · model grok-4.3

classification 💰 econ.EM stat.AP

keywords TFP distributionsagglomerationfirm selectionproductivity decompositionsSpanish firmslocation-scale shiftsnoisy estimates

0 comments

The pith

TFP distributions in denser and less dense areas are identical up to mean and variance shifts.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether total factor productivity distributions for firms in high-density versus low-density locations are the same except for shifts in their average level and spread, plus a possible cutoff of the lowest values. This limited form of equivalence is the key assumption behind methods that split observed city productivity advantages into agglomeration effects versus selection of better firms. Using Spanish administrative records and statistical tools adjusted for measurement error in productivity estimates, the analysis confirms the distributions align once mean and variance are allowed to differ. It also finds that the variance term adds little beyond the mean for explaining patterns across sectors.

Core claim

TFP distributions are statistically identical up to mean, variance, and left-tail truncation parameters, validating the use of such productivity decompositions. Using only the mean and variance is sufficient to capture differences for all sectors. Accordingly, the productivity advantage of cities may be entirely due to agglomeration rather than stronger selection.

What carries the argument

The location-scale-truncation equivalence assumption for TFP distributions, tested with econometric methods adapted to noisy firm-level estimates.

If this is right

City productivity advantages can be attributed entirely to agglomeration economies rather than firm selection.
The standard decomposition approach separating these two channels is empirically supported.
Mean and variance alone describe productivity differences across all sectors without needing truncation.
Policy efforts should target agglomeration channels instead of selection mechanisms.
The same validation approach extends to differences in worker skill distributions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If mean and variance capture the differences, models that track only these two moments may replace full nonparametric comparisons in related settings.
Repeating the checks on data from other countries would test whether the pattern holds beyond the Spanish sample.
The framework could be used to examine productivity distributions across other dimensions such as industry type or firm age.
Similar tests on skill distributions might clarify whether worker quality advantages in cities also reduce to location-scale shifts.

Load-bearing premise

The adapted econometric methods for noisy TFP estimates correctly recover the underlying distributions and do not introduce artifacts that would falsely support the mean-variance-truncation restriction.

What would settle it

A finding that TFP distributions differ substantially across density levels even after allowing for mean and variance shifts and left truncation in any sector or subsample.

Figures

Figures reproduced from arXiv: 2604.13188 by Andrea Sy, Vladislav Morozov.

read the original abstract

Firms in denser areas are more productive, a pattern attributed to agglomeration economies and firm selection. To disentangle these two channels, the popular approach of Combes et al. (2012, ECTA) critically assumes that total factor productivity (TFP) distributions between denser and less dense areas are the same up to mean, variance, and left-tail truncation. We empirically validate this assumption using Spanish administrative firm-level data and recent econometric methods adapted to noisy TFP estimates. Our results find that TFP distributions are indeed statistically identical up to these parameters, validating the use of such productivity decompositions. Furthermore, using only the mean and variance is sufficient to capture differences for all sectors. Accordingly, the productivity advantage of cities may be entirely due to agglomeration rather than stronger selection, suggesting that policymakers should focus on policies targeting agglomeration. Finally, our approach extends to related contexts like differences in worker skill distributions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a direct empirical check on the Combes et al. distributional assumption with Spanish firm data, but the tests' ability to rule out higher-moment differences under realistic TFP noise remains unshown.

read the letter

The core result is that TFP distributions across density levels look the same once you allow for shifts in mean and variance plus left truncation. The authors adapt existing deconvolution-style methods to handle estimated TFP and apply them to Spanish administrative records. They also report that mean and variance alone capture the differences across sectors, which would let researchers drop more complicated selection corrections in many urban productivity studies. That is the useful part: it supplies a concrete data point on an assumption that a lot of applied work takes as given. Credit for doing the check on real micro data rather than just asserting it holds. The policy takeaway follows directly from the finding: if selection is not the driver, then agglomeration-focused policies get more weight. The soft spot is the power question. The abstract and stress-test note both flag that the adapted procedures for noisy TFP lack reported Monte Carlo checks or power calculations against alternatives like differing skewness. If measurement error in TFP flattens higher moments, a non-rejection could be mechanical rather than informative. Without those diagnostics in the main text, the validation is suggestive but not decisive. The paper is aimed at applied urban and spatial economists who use the Combes decomposition or similar mean-variance-truncation setups. Anyone running firm-level productivity regressions across locations would want to see whether the assumption travels to Spanish data. It is worth sending to referees because the question is central and the exercise is falsifiable; the noise-handling details and power evidence are exactly what a review can tighten.

Referee Report

1 major / 2 minor

Summary. The paper empirically validates the key identifying assumption in Combes et al. (2012) that firm TFP distributions in denser versus less-dense Spanish locations are identical up to location, scale, and left-tail truncation. Using administrative firm-level data and adapted econometric procedures for noisy TFP estimates, it reports that the assumption holds statistically, that mean and variance alone suffice for all sectors, and therefore that observed city productivity premia can be attributed entirely to agglomeration rather than differential selection. The approach is also positioned as extensible to other distributional comparisons such as worker skills.

Significance. If the validation survives scrutiny of the test procedures, the paper would supply useful empirical grounding for a decomposition method widely employed in urban economics, permitting cleaner separation of agglomeration and selection channels. The additional claim that higher moments add no explanatory power simplifies application of the framework, while the extension to skill distributions offers a methodological template for related questions. Reliance on administrative data rather than survey samples is a clear strength.

major comments (1)

[Econometric Methods] The section describing the adapted econometric methods (likely the deconvolution or moment-based tests for distributional equivalence) contains no Monte Carlo evidence or power calculations under realistic TFP measurement-error levels. Because the central claim rests on non-rejection of the null that distributions are identical up to mean, variance, and truncation, low power against alternatives with differing skewness or kurtosis would produce false support for the Combes et al. restriction; this issue is load-bearing for the abstract's validation statement and the results in the empirical section.

minor comments (2)

[Data and Descriptive Statistics] Table 1 (or the corresponding data-description table) should report the exact number of firms and observations per density quartile together with the TFP estimation procedure and any trimming rules applied before the distributional tests.
[Abstract] The abstract states that 'using only the mean and variance is sufficient to capture differences for all sectors' but does not indicate whether this conclusion is based on formal tests of higher-moment restrictions or on informal comparison of fitted distributions.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive comments and positive assessment of the paper's contribution to validating the key assumption in Combes et al. (2012). We address the major comment on the econometric methods below.

read point-by-point responses

Referee: [Econometric Methods] The section describing the adapted econometric methods (likely the deconvolution or moment-based tests for distributional equivalence) contains no Monte Carlo evidence or power calculations under realistic TFP measurement-error levels. Because the central claim rests on non-rejection of the null that distributions are identical up to mean, variance, and truncation, low power against alternatives with differing skewness or kurtosis would produce false support for the Combes et al. restriction; this issue is load-bearing for the abstract's validation statement and the results in the empirical section.

Authors: We acknowledge the validity of this concern. The manuscript currently does not include Monte Carlo simulations or explicit power calculations for the adapted tests under measurement error. To address this, we will incorporate a new subsection in the revised version that presents Monte Carlo evidence. The simulations will be designed to reflect the levels of TFP measurement error observed in our Spanish administrative data and will evaluate the power of the tests to detect differences in higher moments (skewness and kurtosis) when the null of equivalence up to mean, variance, and truncation holds. This addition will provide stronger support for our empirical validation and the claims in the abstract. revision: yes

Circularity Check

0 steps flagged

Empirical validation of external assumption shows no circularity

full rationale

The paper's central claim is an empirical statistical test on Spanish administrative firm-level data: TFP distributions are identical up to mean, variance, and left truncation after adapting deconvolution-style methods for noisy TFP estimates. This test is performed on external data rather than being recovered from the paper's own fitted parameters or definitions. The assumption being validated originates in Combes et al. (2012), an independent prior paper with no author overlap. No equations or procedures reduce by construction to the target result; the methods are adapted but the outcome (non-rejection) is a data-driven finding, not a renaming or self-definition. The paper is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The central claim rests on the validity of the 2012 distributional assumption and on the correctness of the adapted econometric procedures for noisy TFP. No new free parameters, axioms, or invented entities are introduced in the abstract.

pith-pipeline@v0.9.0 · 5448 in / 987 out tokens · 36674 ms · 2026-05-10T13:31:54.607327+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page
[2]

D. A. Ackerberg, K. Caves, and G. Frazer. Identification Properties of Recent Production Function Estimators . Econometrica, 83 0 (6): 0 2411--2451, 2015. ISSN 0012-9682. doi:10.3982/ecta13408

work page doi:10.3982/ecta13408 2015
[3]

Almunia, D

M. Almunia, D. López-Rodríguez, and E. Moral-Benito. Evaluating the Macro-Representativeness of a Firm-Level Database: An Application for the Spanish Economy . Occasional Papers 1802, Banco de España, Mar. 2018

work page 2018
[4]

Microdata on Individual Enterprises , 2026

Banco De Espa \ n a . Microdata on Individual Enterprises , 2026. doi:10.48719/BELAB.CBI9523_01

work page doi:10.48719/belab.cbi9523_01 2026
[5]

Barras, P

L. Barras, P. Gagliardini, and O. Scaillet. Skill, Scale , and Value Creation in the Mutual Fund Industry . The Journal of Finance, 77 0 (1): 0 601--638, 2022. ISSN 0022-1082, 1540-6261. doi:10.1111/jofi.13096

work page doi:10.1111/jofi.13096 2022
[6]

P. P. Combes and L. Gobillon. The Empirics of Agglomeration Economies . In Handbook of Regional and Urban Economics , volume 5, pages 247--348. Elsevier, 2015. ISBN 978-0-444-59533-1. doi:10.1016/B978-0-444-59517-1.00005-2

work page doi:10.1016/b978-0-444-59517-1.00005-2 2015
[7]

P. P. Combes, G. Duranton, L. Gobillon, D. Puga, and S. Roux. The Productivity Advantages of Large Cities: Distinguishing Agglomeration From Firm Selection . Econometrica, 80 0 (6): 0 2543--2594, 2012. ISSN 0012-9682. doi:10.3982/ecta8442

work page doi:10.3982/ecta8442 2012
[8]

de la Roca and D

J. de la Roca and D. Puga. Learning By Working in Big Cities . Review of Economic Studies, 84 0 (1): 0 106--142, 2017. ISSN 1467937X. doi:10.1093/restud/rdw031

work page doi:10.1093/restud/rdw031 2017
[9]

Dhaene and K

G. Dhaene and K. Jochmans. Split-Panel Jackknife Estimation of Fixed-Effect Models . The Review of Economic Studies, 82 0 (3): 0 991--1030, 2015. doi:10.1093/restud/rdv007

work page doi:10.1093/restud/rdv007 2015
[10]

Duranton and D

G. Duranton and D. Puga. The Economics of Urban Density . Journal of Economic Perspectives, 34 0 (3): 0 3--26, Aug. 2020. ISSN 0895-3309. doi:10.1257/jep.34.3.3

work page doi:10.1257/jep.34.3.3 2020
[11]

J. Durbin. Kolmogorov- Smirnov Tests When Parameters Are Estimated With Applications to Tests of Exponentiality and Tests on Spacings . Biometrika, 62 0 (1): 0 5--22, 1975. ISSN 0006-3444, 1464-3510. doi:10.1093/biomet/62.1.5

work page doi:10.1093/biomet/62.1.5 1975
[12]

C. Gaubert. Firm Sorting and Agglomeration . American Economic Review, 108 0 (11): 0 3117--3153, Nov. 2018. ISSN 0002-8282. doi:10.1257/aer.20150361

work page doi:10.1257/aer.20150361 2018
[13]

Gonz\'alez and A

B. Gonz\'alez and A. Sy. Credit Without Collateral: Firm Size and Debt Structure . Technical report, 2025

work page 2025
[14]

Hsieh and P

C.-T. Hsieh and P. J. Klenow. Misallocation and Manufacturing TFP in China and India . The Quarterly Journal of Economics, 124 0 (4): 0 1403--1448, None 2009

work page 2009
[15]

Jochmans and M

K. Jochmans and M. Weidner. Inference on a Distribution from Noisy Draws . Econometric Theory, 40 0 (1): 0 60--97, 2024. doi:10.1017/S0266466622000378

work page doi:10.1017/s0266466622000378 2024
[16]

Kalemli-Ozcan, B

S. Kalemli-Ozcan, B. Sorensen, C. Villegas-Sanchez, V. Volosovych, and S. Yesiltas. How to Construct Nationally Representative Firm Level Data from the Orbis Global Database: New Facts on SMEs and Aggregate Implications for Industry Concentration . American Economic Journal: Macroeconomics, 16 0 (2): 0 353--374, April 2024

work page 2024
[17]

H. J. Khamis. The -Corrected Kolmogorov-Smirnov Test with Estimated Parameters . Journal of Nonparametric Statistics, 2 0 (1): 0 17--27, Jan. 1992. ISSN 1048-5252, 1029-0311. doi:10.1080/10485259208832539

work page doi:10.1080/10485259208832539 1992
[18]

P. C. Melo, D. J. Graham, and R. B. Noland. A Meta-Analysis of Estimates of Urban Agglomeration Economies . Regional Science and Urban Economics, 39 0 (3): 0 332--342, 2009. ISSN 01660462. doi:10.1016/j.regsciurbeco.2008.12.002

work page doi:10.1016/j.regsciurbeco.2008.12.002 2009
[19]

V. Morozov. Inference on Extreme Quantiles of Unobserved Individual Heterogeneity . Econometric Theory, pages 1--52, Jan. 2026. ISSN 0266-4666, 1469-4360. doi:10.1017/S0266466625100315

work page doi:10.1017/s0266466625100315 2026
[20]

Chapter 18 -

D. Neumark and H. Simpson. Place- Based Policies . In Handbook of Regional and Urban Economics , volume 5, pages 1197--1287. Elsevier, 2015. ISBN 978-0-444-59533-1. doi:10.1016/B978-0-444-59531-7.00018-1

work page doi:10.1016/b978-0-444-59531-7.00018-1 2015
[21]

Journal of Econometrics , author =

R. Okui and T. Yanagi. Panel Data Analysis with Heterogeneous Dynamics . Journal of Econometrics, 212 0 (2): 0 451--475, 2019. ISSN 0304-4076. doi:10.1016/j.jeconom.2019.04.036. URL 10.1016/j.jeconom.2019.04.036

work page doi:10.1016/j.jeconom.2019.04.036 2019

[1] [1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page

[2] [2]

D. A. Ackerberg, K. Caves, and G. Frazer. Identification Properties of Recent Production Function Estimators . Econometrica, 83 0 (6): 0 2411--2451, 2015. ISSN 0012-9682. doi:10.3982/ecta13408

work page doi:10.3982/ecta13408 2015

[3] [3]

Almunia, D

M. Almunia, D. López-Rodríguez, and E. Moral-Benito. Evaluating the Macro-Representativeness of a Firm-Level Database: An Application for the Spanish Economy . Occasional Papers 1802, Banco de España, Mar. 2018

work page 2018

[4] [4]

Microdata on Individual Enterprises , 2026

Banco De Espa \ n a . Microdata on Individual Enterprises , 2026. doi:10.48719/BELAB.CBI9523_01

work page doi:10.48719/belab.cbi9523_01 2026

[5] [5]

Barras, P

L. Barras, P. Gagliardini, and O. Scaillet. Skill, Scale , and Value Creation in the Mutual Fund Industry . The Journal of Finance, 77 0 (1): 0 601--638, 2022. ISSN 0022-1082, 1540-6261. doi:10.1111/jofi.13096

work page doi:10.1111/jofi.13096 2022

[6] [6]

P. P. Combes and L. Gobillon. The Empirics of Agglomeration Economies . In Handbook of Regional and Urban Economics , volume 5, pages 247--348. Elsevier, 2015. ISBN 978-0-444-59533-1. doi:10.1016/B978-0-444-59517-1.00005-2

work page doi:10.1016/b978-0-444-59517-1.00005-2 2015

[7] [7]

P. P. Combes, G. Duranton, L. Gobillon, D. Puga, and S. Roux. The Productivity Advantages of Large Cities: Distinguishing Agglomeration From Firm Selection . Econometrica, 80 0 (6): 0 2543--2594, 2012. ISSN 0012-9682. doi:10.3982/ecta8442

work page doi:10.3982/ecta8442 2012

[8] [8]

de la Roca and D

J. de la Roca and D. Puga. Learning By Working in Big Cities . Review of Economic Studies, 84 0 (1): 0 106--142, 2017. ISSN 1467937X. doi:10.1093/restud/rdw031

work page doi:10.1093/restud/rdw031 2017

[9] [9]

Dhaene and K

G. Dhaene and K. Jochmans. Split-Panel Jackknife Estimation of Fixed-Effect Models . The Review of Economic Studies, 82 0 (3): 0 991--1030, 2015. doi:10.1093/restud/rdv007

work page doi:10.1093/restud/rdv007 2015

[10] [10]

Duranton and D

G. Duranton and D. Puga. The Economics of Urban Density . Journal of Economic Perspectives, 34 0 (3): 0 3--26, Aug. 2020. ISSN 0895-3309. doi:10.1257/jep.34.3.3

work page doi:10.1257/jep.34.3.3 2020

[11] [11]

J. Durbin. Kolmogorov- Smirnov Tests When Parameters Are Estimated With Applications to Tests of Exponentiality and Tests on Spacings . Biometrika, 62 0 (1): 0 5--22, 1975. ISSN 0006-3444, 1464-3510. doi:10.1093/biomet/62.1.5

work page doi:10.1093/biomet/62.1.5 1975

[12] [12]

C. Gaubert. Firm Sorting and Agglomeration . American Economic Review, 108 0 (11): 0 3117--3153, Nov. 2018. ISSN 0002-8282. doi:10.1257/aer.20150361

work page doi:10.1257/aer.20150361 2018

[13] [13]

Gonz\'alez and A

B. Gonz\'alez and A. Sy. Credit Without Collateral: Firm Size and Debt Structure . Technical report, 2025

work page 2025

[14] [14]

Hsieh and P

C.-T. Hsieh and P. J. Klenow. Misallocation and Manufacturing TFP in China and India . The Quarterly Journal of Economics, 124 0 (4): 0 1403--1448, None 2009

work page 2009

[15] [15]

Jochmans and M

K. Jochmans and M. Weidner. Inference on a Distribution from Noisy Draws . Econometric Theory, 40 0 (1): 0 60--97, 2024. doi:10.1017/S0266466622000378

work page doi:10.1017/s0266466622000378 2024

[16] [16]

Kalemli-Ozcan, B

S. Kalemli-Ozcan, B. Sorensen, C. Villegas-Sanchez, V. Volosovych, and S. Yesiltas. How to Construct Nationally Representative Firm Level Data from the Orbis Global Database: New Facts on SMEs and Aggregate Implications for Industry Concentration . American Economic Journal: Macroeconomics, 16 0 (2): 0 353--374, April 2024

work page 2024

[17] [17]

H. J. Khamis. The -Corrected Kolmogorov-Smirnov Test with Estimated Parameters . Journal of Nonparametric Statistics, 2 0 (1): 0 17--27, Jan. 1992. ISSN 1048-5252, 1029-0311. doi:10.1080/10485259208832539

work page doi:10.1080/10485259208832539 1992

[18] [18]

P. C. Melo, D. J. Graham, and R. B. Noland. A Meta-Analysis of Estimates of Urban Agglomeration Economies . Regional Science and Urban Economics, 39 0 (3): 0 332--342, 2009. ISSN 01660462. doi:10.1016/j.regsciurbeco.2008.12.002

work page doi:10.1016/j.regsciurbeco.2008.12.002 2009

[19] [19]

V. Morozov. Inference on Extreme Quantiles of Unobserved Individual Heterogeneity . Econometric Theory, pages 1--52, Jan. 2026. ISSN 0266-4666, 1469-4360. doi:10.1017/S0266466625100315

work page doi:10.1017/s0266466625100315 2026

[20] [20]

Chapter 18 -

D. Neumark and H. Simpson. Place- Based Policies . In Handbook of Regional and Urban Economics , volume 5, pages 1197--1287. Elsevier, 2015. ISBN 978-0-444-59533-1. doi:10.1016/B978-0-444-59531-7.00018-1

work page doi:10.1016/b978-0-444-59531-7.00018-1 2015

[21] [21]

Journal of Econometrics , author =

R. Okui and T. Yanagi. Panel Data Analysis with Heterogeneous Dynamics . Journal of Econometrics, 212 0 (2): 0 451--475, 2019. ISSN 0304-4076. doi:10.1016/j.jeconom.2019.04.036. URL 10.1016/j.jeconom.2019.04.036

work page doi:10.1016/j.jeconom.2019.04.036 2019