Beyond Land Surface Temperature: Explainable Spatial Machine Learning Reveals Urban Morphology Effects on Human-Centric Heat Stress
Pith reviewed 2026-05-08 12:21 UTC · model grok-4.3
The pith
Sky view factor drives most variation in human heat stress but contributes little to land surface temperature, showing LST misses key shading effects.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using a Modeling-Comparing-Assessing framework with Landsat LST and GPU-accelerated 1-m UTCI in Singapore, the study finds notable spatial discrepancies between the two thermal metrics and substantial non-stationary effects from urban morphology factors. Spatially explicit SHAP values show sky view factor as central to UTCI variability yet marginal for LST, while SHAP-GAM links higher albedo to increased UTCI. These results demonstrate that LST inadequately captures the shading-driven and radiative processes that govern human heat stress.
What carries the argument
Geographically weighted XGBoost (GW-XGBoost) combined with spatially explicit SHAP explanations applied to 2D and 3D urban morphology factors to compare LST and UTCI.
If this is right
- Urban planners should incorporate physiologically relevant indices like UTCI rather than relying primarily on LST for heat-risk assessment.
- Higher surface albedo may increase human heat stress despite cooling surface temperatures.
- 3D urban features such as sky view factor require location-specific consideration because their effects on heat stress vary spatially.
- Targeted interventions like improved shading can reduce human heat stress even where they have limited impact on satellite-measured LST.
Where Pith is reading between the lines
- Satellite LST maps used for city-wide heat planning may systematically understate the benefits of adding shade structures in dense neighborhoods.
- The approach could be extended to other cities by testing whether similar sky-view discrepancies appear when comparing LST and UTCI at matching resolutions.
- Integration with pedestrian-level wind and humidity sensors would help isolate whether the sky view factor effect on UTCI is truly independent of airflow.
- This distinction matters for equity, as areas with low sky view factor often coincide with dense, lower-income districts where heat exposure affects vulnerable populations most.
Load-bearing premise
The 1-meter UTCI model and chosen urban morphology variables fully capture human heat stress without major missing influences from wind, humidity, activity, or other confounders, and the identified relationships reflect causal mechanisms rather than mere correlations.
What would settle it
Direct field measurements of pedestrian thermal comfort or physiological responses across sites with differing sky view factors that align more closely with LST patterns than UTCI patterns, or that remove the sky view factor effect once local wind is accounted for.
Figures
read the original abstract
Heat exposure connects the built environment and public health, directly shaping the livability and sustainability of urban areas. Understanding the spatial heterogeneity of heat exposure and its drivers is vital for climate-adaptive urban planning. However, most planning-oriented studies rely on land surface temperature (LST), and whether LST adequately represents human heat exposure and how it differs from physiologically relevant heat stress remains insufficiently examined. Here, adopting Landsat-retrieved 30-m LST and GPU-accelerated 1-m universal thermal climate index (UTCI) in Singapore, this study establishes a comprehensive "Modeling-Comparing-Assessing" framework to systematically evaluate the spatial and mechanistic discrepancies between the two metrics. We further investigate pronounced non-stationary and threshold-based quantitative relationships of the two metrics with urban factors by employing a novel geographically weighted XGBoost (GW-XGBoost) and generalized additive model (GAM) workflow. Our results demonstrate notable discrepancies in spatial patterns of LST and UTCI, along with substantial spatial heterogeneity in how 2D and 3D urban factors impact these two thermal metrics, as revealed by explainable GW-XGBoost models (global out-of-bag R2 = 0.855 for LST and 0.905 for UTCI, respectively). Crucially, spatially explicit SHAP interprets that sky view factor plays a central role in explaining UTCI variability but exhibits a comparatively marginal independent contribution to LST, indicating that LST inadequately captures shading-driven and radiative processes governing actual human heat stress. Notably, SHAP-GAM analysis indicates that higher albedo is associated with increased UTCI. These novel findings provide evidence for integrating physiologically relevant thermal indices to inform targeted heat risk management and climate-adaptive urban planning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a 'Modeling-Comparing-Assessing' framework using 30-m Landsat LST and 1-m GPU-accelerated UTCI data for Singapore to compare land surface temperature with human-centric heat stress metrics. It employs geographically weighted XGBoost (GW-XGBoost) models with SHAP explanations and generalized additive models (GAM) to reveal spatial heterogeneity and non-stationary relationships with 2D/3D urban morphology factors, claiming that sky view factor (SVF) has a central role in UTCI but marginal contribution to LST, thus indicating LST inadequately captures shading and radiative processes relevant to human heat stress. Global out-of-bag R² values are reported as 0.855 for LST and 0.905 for UTCI.
Significance. If the central claim regarding the differential role of SVF holds after addressing potential resolution artifacts, this work would be significant for urban climate research and planning, as it provides evidence-based arguments for shifting from LST to physiologically relevant indices like UTCI in heat risk assessment, supported by high predictive performance and explainable spatial ML techniques. The integration of high-resolution modeling and interpretable ML is a strength.
major comments (2)
- [Results (SHAP interpretations)] The claim that sky view factor (SVF) has a central role in explaining UTCI variability but marginal independent contribution to LST (as per abstract and SHAP-GAM analysis) may be driven by the resolution mismatch between the 30-m LST and 1-m UTCI data. Since LST spatially averages fine-scale shading effects that SVF parameterizes at 1 m, the lower SHAP contribution to LST could be an aggregation artifact rather than evidence of LST's inadequacy in capturing shading-driven processes. An aggregation test or resolution-matched comparison is required to support this load-bearing interpretation.
- [Methods (model training and validation)] Details on cross-validation strategy, handling of spatial autocorrelation, error propagation, and sensitivity to data exclusions are missing despite the spatially explicit nature of the GW-XGBoost models. These are necessary to verify the reported out-of-bag R² values (0.855 for LST, 0.905 for UTCI) and the robustness of the non-stationary relationships identified.
minor comments (2)
- [Abstract] The abstract mentions 'notable discrepancies in spatial patterns' but does not quantify them; consider adding specific metrics or references to figures.
- [Discussion] The finding that higher albedo is associated with increased UTCI should be discussed in context of potential confounding factors such as material properties or measurement biases.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive review, which highlights important considerations for strengthening the interpretation of our results and the transparency of our methods. We address each major comment below and outline the revisions we will make to the manuscript.
read point-by-point responses
-
Referee: [Results (SHAP interpretations)] The claim that sky view factor (SVF) has a central role in explaining UTCI variability but marginal independent contribution to LST (as per abstract and SHAP-GAM analysis) may be driven by the resolution mismatch between the 30-m LST and 1-m UTCI data. Since LST spatially averages fine-scale shading effects that SVF parameterizes at 1 m, the lower SHAP contribution to LST could be an aggregation artifact rather than evidence of LST's inadequacy in capturing shading-driven processes. An aggregation test or resolution-matched comparison is required to support this load-bearing interpretation.
Authors: We agree that the resolution difference between the 30-m LST and 1-m UTCI datasets is a relevant factor that merits explicit examination to rule out aggregation artifacts. Our core claim is that LST, as commonly available at 30 m, does not resolve the fine-scale shading and radiative processes captured by 1-m UTCI, which is why SVF shows lower independent contribution to LST. This is physically expected rather than purely artifactual, but we acknowledge that a direct test would provide stronger evidence. In the revised manuscript we will add an aggregation analysis: we will upscale the 1-m UTCI to 30-m resolution, retrain the GW-XGBoost model on the aggregated data, and compare the resulting SHAP values for SVF against the original 1-m results. This will be presented in a new supplementary figure and discussed in the Results section to confirm that the differential role of SVF persists beyond simple aggregation effects. revision: yes
-
Referee: [Methods (model training and validation)] Details on cross-validation strategy, handling of spatial autocorrelation, error propagation, and sensitivity to data exclusions are missing despite the spatially explicit nature of the GW-XGBoost models. These are necessary to verify the reported out-of-bag R² values (0.855 for LST, 0.905 for UTCI) and the robustness of the non-stationary relationships identified.
Authors: We appreciate this request for greater methodological transparency. The current manuscript reports global out-of-bag R² values from the GW-XGBoost models but does not fully detail the spatial cross-validation procedure, autocorrelation handling, or sensitivity checks. In the revised version we will expand the Methods section to include: (1) a description of the spatial blocking cross-validation strategy used to mitigate autocorrelation (with block sizes chosen based on variogram analysis); (2) explicit discussion of how input data uncertainties (e.g., from Landsat LST retrieval and UTCI modeling) propagate through the models; and (3) results from sensitivity tests that systematically exclude subsets of data (e.g., by land-use type or spatial region) to assess stability of the reported R² values and the identified non-stationary relationships. These additions will be supported by new supplementary tables and text. revision: yes
Circularity Check
No significant circularity; data-driven ML comparison is self-contained
full rationale
The paper trains separate GW-XGBoost models on urban morphology predictors to explain 30 m LST and 1 m UTCI, then applies SHAP for interpretation. These are standard supervised fits on independent inputs (satellite retrievals and GPU-modeled index) with no equations that reduce outputs to inputs by construction, no self-citations invoked as uniqueness theorems, and no ansatz or renaming that collapses the claimed discrepancy. Global OOB R² values and SHAP rankings are direct model outputs, not tautological. Minor hyperparameter choices exist but do not load-bear the central LST-vs-UTCI mechanistic claim.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The 1-m resolution GPU-accelerated UTCI model accurately represents physiologically relevant human heat stress
- domain assumption Selected 2D and 3D urban morphology factors are the primary drivers of spatial variation in both thermal metrics
Reference graph
Works this paper leans on
-
[1]
Building and Environment 288, 113992
Improving WRF-BEP+BEM performance in simulating wind-temperature-humidity in high-density urban areas: A case study of a megacity. Building and Environment 288, 113992. doi:10.1016/j.enbuild.2025.113992. Huang, B., Wu, B., Barry, M., 2010. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Internation...
-
[2]
Global urban population exposure to extreme heat. Proceedings of the National Academy of Sciences 118, e2024792118. U.S. Geological Survey, 2025. Landsat 8 mission — u.s. geological survey. URL:https://www.usgs. gov/landsat-missions/landsat-8. Wan, Y., Du, H., Yuan, L., Xu, X., Tang, H., Zhang, J., 2025. Exploring the influence of block environmental char...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.