Is segregation encoded in urban form? An entropy-based analysis
Pith reviewed 2026-05-10 04:52 UTC · model grok-4.3
The pith
Built form encodes residential segregation in non-linear entropy patterns.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We find non-linear relationships between built-form entropy, income, and segregation: income levels and residential clustering increase toward both extremes of the entropy spectrum, with a stronger rise at the high-entropy end. This asymmetry suggests that high-entropy urban forms are associated with distinct spatial processes of segregation, including elite enclaving and incremental development in lower-income settlements, while low-entropy forms reflect more selective occupation shaped by planning and market filtering. Overall, the findings suggest that built form is more than a neutral backdrop, functioning as both affordance and signal of segregation.
What carries the argument
Built-form entropy (BFE), a Shannon entropy measure computed from building footprints and aligned with income data through regular spatial tessellation.
If this is right
- Income levels and residential clustering rise toward both low and high ends of the BFE spectrum.
- High-entropy urban forms associate with elite enclaving and incremental development in lower-income settlements.
- Low-entropy forms reflect selective occupation shaped by planning and market filtering.
- Built form functions as both affordance and signal of segregation.
- The association is asymmetric, with stronger effects at the high-entropy end.
Where Pith is reading between the lines
- The same entropy-segregation pattern could be tested in other large cities to check if it is a general feature of urban growth.
- Urban design policies might target intermediate entropy levels to reduce the conditions that support concentrated income groups.
- The framework could be extended to track how changes in building form over time correspond to shifts in segregation.
- It links physical morphology directly to social outcomes, suggesting morphology metrics could complement traditional economic models of inequality.
Load-bearing premise
That built-form entropy regimes associate with the spatial distribution of income groups and their local clustering in non-linear ways.
What would settle it
A re-analysis of Sao Paulo building and income data showing linear relationships or no association between BFE values and Gini or Moran’s I measures of income segregation.
Figures
read the original abstract
The footprints of residential segregation have long been documented, yet the role of urban form as both medium and manifestation of segregation remains under-specified. We investigate whether the configuration of the built fabric may encode residential segregation in its spatial structure, hypothesising that built-form entropy (BFE) regimes are associated with the spatial distribution of income groups and their local clustering in non-linear ways. We examine this by quantifying BFE through a Shannon-based measure computed from building footprints, characterising income-based distributions using the Gini index and Moran's I, and placing both on a common spatial footing through a regular tessellation. Applying this framework to Sao Paulo, Latin America's largest city, we find non-linear relationships between BFE, income, and segregation: income levels and residential clustering increase toward both extremes of the entropy spectrum, with a stronger rise at the high-entropy end. This asymmetry suggests that high-entropy urban forms are associated with distinct spatial processes of segregation, including elite enclaving and incremental development in lower-income settlements, while low-entropy forms reflect more selective occupation shaped by planning and market filtering. Overall, the findings suggest that built form is more than a neutral backdrop, functioning as both affordance and signal of segregation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that built-form entropy (BFE), measured via Shannon entropy on building footprints, encodes residential segregation in its spatial structure. Using a regular tessellation to align BFE with income-based metrics (Gini index for distribution and Moran's I for clustering) in São Paulo, the authors report non-linear relationships: income levels and residential clustering increase toward both extremes of the BFE spectrum, with a stronger rise at the high-entropy end. This asymmetry is interpreted as evidence that high-entropy forms are associated with elite enclaving and incremental development, while low-entropy forms reflect selective occupation, positioning built form as both affordance and signal of segregation rather than a neutral backdrop.
Significance. If the reported non-linear associations prove robust, the work would be significant for socio-physics and urban studies by providing a quantitative demonstration that physical urban morphology is intertwined with socio-economic segregation processes. The entropy-based framework offers a fresh lens beyond standard indices, with potential implications for how urban form shapes and reflects inequality in rapidly growing cities like São Paulo.
major comments (2)
- [Abstract (framework description)] The central claim that BFE regimes are associated with income distribution and clustering in non-linear ways (stronger at high-entropy) is load-bearing on the regular tessellation used to place BFE and income metrics on a common grid. The abstract describes this alignment but reports no sensitivity analysis to cell size. If the non-linearity disappears or reverses at finer or coarser resolutions, the association may be an artifact of the arbitrary spatial aggregation rather than genuine encoding of segregation in urban form. This requires explicit robustness checks to support the hypothesis.
- [Abstract (results description)] The abstract states that non-linear relationships were found but supplies no statistical details such as regression coefficients, p-values, error bars, sample sizes per tessellation cell, or robustness to data exclusion rules. Without these, it is difficult to evaluate whether the asymmetry (stronger rise at high-entropy) is statistically reliable or driven by outliers in the São Paulo dataset.
minor comments (1)
- [Abstract] Clarify the exact definition and computation of BFE (e.g., how building footprints are categorized into types for the Shannon entropy calculation) to allow replication.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. The comments highlight important aspects of robustness and statistical transparency that we will address in the revision. Below we respond point by point to the major comments.
read point-by-point responses
-
Referee: [Abstract (framework description)] The central claim that BFE regimes are associated with income distribution and clustering in non-linear ways (stronger at high-entropy) is load-bearing on the regular tessellation used to place BFE and income metrics on a common grid. The abstract describes this alignment but reports no sensitivity analysis to cell size. If the non-linearity disappears or reverses at finer or coarser resolutions, the association may be an artifact of the arbitrary spatial aggregation rather than genuine encoding of segregation in urban form. This requires explicit robustness checks to support the hypothesis.
Authors: We agree that demonstrating invariance to tessellation scale is necessary to rule out aggregation artifacts. The primary analysis uses a 1 km regular grid chosen to balance spatial resolution with data coverage in São Paulo. In the revised manuscript we will add an explicit sensitivity analysis section (or appendix) that recomputes BFE, Gini, and Moran's I at 500 m and 2 km resolutions, reports the corresponding quadratic regression coefficients and p-values, and shows that the non-linear pattern and the stronger high-entropy asymmetry persist across scales. This will directly test the referee's concern. revision: yes
-
Referee: [Abstract (results description)] The abstract states that non-linear relationships were found but supplies no statistical details such as regression coefficients, p-values, error bars, sample sizes per tessellation cell, or robustness to data exclusion rules. Without these, it is difficult to evaluate whether the asymmetry (stronger rise at high-entropy) is statistically reliable or driven by outliers in the São Paulo dataset.
Authors: The full manuscript already contains quadratic regression results (including coefficients for the linear and squared BFE terms, associated p-values, R² values, and 95 % confidence intervals) together with cell-level sample sizes and a description of exclusion criteria (cells with fewer than 10 buildings or incomplete income data). To improve accessibility we will revise the abstract to include concise quantitative statements, for example noting the significance of the quadratic term and the direction of the high-entropy effect. We will also add a short paragraph on robustness to alternative exclusion thresholds in the methods if it is not already explicit. revision: yes
Circularity Check
No circularity; empirical metrics aligned on independent tessellation
full rationale
The paper defines BFE via Shannon entropy on building footprints, computes Gini and Moran's I separately on income data, and aligns the two via a regular tessellation before reporting observed non-linear associations. No step fits a parameter to the target relationship and then renames the fit as a prediction, no self-citation supplies a uniqueness theorem or ansatz that the central claim rests upon, and the derivation does not reduce any reported association to its own inputs by construction. The analysis remains an open empirical comparison against external spatial data.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Shannon entropy computed from building footprints yields a meaningful regime of urban form that can be compared across space.
- domain assumption Gini index and Moran's I on income data adequately represent distribution and local clustering for the purpose of detecting segregation.
Reference graph
Works this paper leans on
-
[1]
Burgess, E. W. Residential Segregation in American Cities.The ANNALS of the American Academy of Political and Social Science140, 105–115 (1928). URLhttps://journals.sagepub.com/doi/10. 1177/000271622814000115
work page 1928
-
[2]
Wong, D. W. Formulating a General Spatial Segregation Measure.The Professional Geographer57, 285– 294 (2005). URLhttp://www.tandfonline.com/doi/abs/10.1111/j.0033-0124.2005.00478.x
-
[3]
Netto, V., Krenz, K., Fiszon, M., Peres, O. & Rosalino, D. Decoding segregation: Navigating a century of seg- regation research across disciplines and introducing a bottom-up ontology.arXiv preprint arXiv:2410.08374 (2024)
- [4]
-
[5]
Balch, E. G. Hull house maps and papers (1895)
- [6]
-
[7]
James, D. R. & Taeuber, K. E. Measures of segregation.Sociological methodology15, 1–32 (1985)
work page 1985
-
[8]
Theil, H. & Finizza, A. J. A note on the measurement of racial integration of schools by means of informa- tional concepts (1971)
work page 1971
-
[9]
Reardon, S. F . & O’Sullivan, D. Measures of Spatial Segregation.Sociological Methodology34, 121–162 (2004). URLhttps://journals.sagepub.com/doi/10.1111/j.0081-1750.2004.00150.x
-
[10]
On the measure of geographic segregation
Morrill, R. On the measure of geographic segregation. InGeography research forum, vol. 11, 25–36 (1991)
work page 1991
-
[11]
Wong, D. W. S. Spatial Indices of Segregation.Urban Studies30, 559–572 (1993). URLhttps:// journals.sagepub.com/doi/10.1080/00420989320080551
-
[12]
Vaughan, L. The relationship between physical segregation and social marginalisation in the urban envi- ronment.World Architecture185, 88–96 (2005)
work page 2005
-
[13]
The spatial syntax of urban segregation.Progress in Planning67, 199–294 (2007)
Vaughan, L. The spatial syntax of urban segregation.Progress in Planning67, 199–294 (2007)
work page 2007
-
[14]
Saboya, R. T. d. & Peres, O. M. Chasm: A configurational measure of socio-spatial residential segregation. Environment and Planning B: Urban Analytics and City Science52, 1464–1481 (2025). 21
work page 2025
-
[15]
Brigatti, E., Netto, V. M., De Sousa Filho, F . N. M. & Cacholas, C. Entropy and hierarchical clustering: Characterizing the morphology of the urban fabric in different spatial cultures.Chaos: An Interdisciplinary Journal of Nonlinear Science31, 113138 (2021). URLhttps://pubs.aip.org/cha/article/31/11/ 113138/342078/Entropy-and-hierarchical-clustering-Cha...
work page 2021
-
[16]
Netto, V. M., Brigatti, E. & Cacholas, C. From urban form to information: Cellular configurations in different spatial cultures.Environment and Planning B: Urban Analytics and City Science50, 146–161 (2023). URL https://journals.sagepub.com/doi/10.1177/23998083221107382
-
[17]
Hillier, B., Penn, A., Hanson, J., Grajewski, T. & Xu, J. Natural movement: or, configuration and attraction in urban pedestrian movement.Environment and Planning B: planning and design20, 29–66 (1993)
work page 1993
-
[18]
Roberto, E. & Korver-Glenn, E. The spatial structure and local experience of residential segregation.Spatial Demography9, 277–307 (2021)
work page 2021
-
[19]
& Hanson, J.The social logic of space(Cambridge university press, 1984)
Hillier, B. & Hanson, J.The social logic of space(Cambridge university press, 1984)
work page 1984
-
[20]
Toms, J. D. & Lesperance, M. L. Piecewise regression: a tool for identifying ecological thresholds.Ecology 84, 2034–2041 (2003)
work page 2034
- [21]
- [22]
-
[23]
Catney, G. The complex geographies of ethnic residential segregation: Using spatial and local measures to explore scale-dependency and spatial relationships.Transactions of the Institute of British Geographers 43, 137–152 (2018)
work page 2018
-
[24]
Alonso, W.Location and land use: Toward a general theory of land rent(Harvard university press, 1964)
work page 1964
-
[25]
Brueckner, J. K., Thisse, J.-F . & Zenou, Y . Why is central paris rich and downtown detroit poor?: An amenity-based theory.European economic review43, 91–107 (1999)
work page 1999
-
[26]
Guerrieri, V., Hartley, D. & Hurst, E. Endogenous gentrification and housing price dynamics.Journal of Public Economics100, 45–60 (2013)
work page 2013
- [27]
-
[28]
Schelling, T. C. Dynamic models of segregation.Journal of mathematical sociology1, 143–186 (1971)
work page 1971
-
[29]
Roberto, E. The spatial proximity and connectivity method for measuring and analyzing residential segre- gation.Sociological Methodology48, 182–224 (2018)
work page 2018
-
[30]
Shannon, C. E. A mathematical theory of communication.Bell system technical journal27, 379–423 (1948)
work page 1948
-
[31]
Shannon, C. E. Prediction and entropy of printed english.Bell system technical journal30, 50–64 (1951)
work page 1951
-
[32]
Schürmann, T. & Grassberger, P . Entropy estimation of symbol sequences.Chaos: An Interdisciplinary Journal of Nonlinear Science6, 414–427 (1996)
work page 1996
-
[33]
Cover, T. M. & Thomas, J. A. Elements of information theory, 1991 john wiley & sons.Inc. Print ISBN 0-471-06259-6 Online ISBN 0-471-20061-1(1991)
work page 1991
-
[34]
universal and accessible entropy estimation using a com- pression algorithm
Brigatti, E. & de Sousa Filho, F . Comment on “universal and accessible entropy estimation using a com- pression algorithm”.Physical Review Letters129, 029801 (2022)
work page 2022
-
[35]
M.et al.Cities, from information to interaction.Entropy20, 834 (2018)
Netto, V. M.et al.Cities, from information to interaction.Entropy20, 834 (2018)
work page 2018
-
[36]
de Sousa Filho, F ., Pereira de Sá, V. & Brigatti, E. Entropy estimation in bidimensional sequences.Physical Review E105, 054116 (2022)
work page 2022
- [37]
-
[38]
de Geografia e Estatística, I. B.Censo demográfico 2010: Características da população e dos domicílios: Resultados do universo(IBGE, Rio de Janeiro, 2010)
work page 2010
-
[39]
URLhttps://cran.r-project.org/ package=DescTools
Signorell, A.DescTools: Tools for Descriptive Statistics(2025). URLhttps://cran.r-project.org/ package=DescTools. R package version 0.99.59.7
work page 2025
-
[40]
Frank, A. I. Using measures of spatial autocorrelation to describe socio-economic and racial residential patterns in us urban areas.Socio-economic applications of geographic information science (Innovations in GIS). London: Taylor & Francis147–162 (2003)
work page 2003
-
[41]
Bivand, R. S. & Wong, D. W. S. Comparing implementations of global and local indicators of spatial asso- ciation.Test27, 716–748 (2018)
work page 2018
-
[42]
& Piras, G.spdep: Spatial Dependence: Weighting Schemes, Statistics and Models (2024)
Bivand, R., Krainski, E. & Piras, G.spdep: Spatial Dependence: Weighting Schemes, Statistics and Models (2024). URLhttps://cran.r-project.org. R package version 1.3-3
work page 2024
-
[43]
Prener, C. Areal-weighted interpolation. areal R package vignette (2022). URLhttps://cran. 22 r-project.org/web/packages/areal/vignettes/areal-weighted-interpolation.html. 23
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.