A Latent Factor Panel Approach to Spatiotemporal Causal Inference
Pith reviewed 2026-05-18 16:20 UTC · model grok-4.3
The pith
Latent factor models partially identify causal effects in spatiotemporal data with interference
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under a factor confounding assumption the effects of unmeasured confounders on exposures and outcomes are captured by a shared latent factor model. This assumption alone is sufficient to partially identify causal effects even when units interfere with one another. Additional assumptions that limit the degree of spatiotemporal interference are then sufficient to point-identify the effects.
What carries the argument
The factor confounding assumption that models unmeasured confounding through shared latent factors in a panel-data framework for spatiotemporal settings.
If this is right
- The method substantially reduces omitted-variable bias relative to spatial-smoothing and standard panel-data baselines in simulation studies.
- The approach can be applied to estimate the effect of prenatal PM2.5 exposure on birth weight using California data.
- Partial identification of causal effects holds without requiring the assumption of no interference between units.
- Point identification follows once the degree of spatiotemporal interference is limited by additional assumptions reasonable in most applications.
Where Pith is reading between the lines
- The same latent-factor adjustment could be tested in other observational domains that feature both geographic structure and potential interference, such as economic or social-network panels.
- Sensitivity checks that vary the number of latent factors or examine residual spatial correlation would help assess how strongly results depend on the factor model.
- Connections to multivariate causal-inference techniques for multiple outcomes or treatments may offer further ways to strengthen identification.
Load-bearing premise
That the influence of unmeasured confounders on both exposures and outcomes can be represented by a shared latent factor model.
What would settle it
A setting in which the true confounding structure cannot be captured by low-rank latent factors, yet the method still produces estimates that match those from a randomized experiment or other gold-standard design in the same data.
Figures
read the original abstract
Unmeasured confounding can severely bias causal effect estimates from spatiotemporal observational data, especially when the confounders do not vary smoothly in time and space. In this work, we develop a method for addressing unmeasured confounding in spatiotemporal contexts by building on models from the panel data literature and methods in multivariate causal inference. Our method is based on a factor confounding assumption, which posits that effects of unmeasured confounders on exposures and outcomes can be captured by a shared latent factor model. Factor confounding is sufficient to partially identify causal effects, even when there is interference between units. Additional assumptions that limit the degree of spatiotemporal interference, reasonable in most applications, are sufficient to point identify the effects. Simulation studies demonstrate that the proposed approach can substantially reduce omitted variable bias relative to other spatial smoothing and panel data baselines. We illustrate our method in a case study of the effect of prenatal PM2.5 exposure on birth weight in California.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a latent factor panel model to address unmeasured confounding in spatiotemporal causal inference. It assumes that unmeasured confounders affect both exposures and outcomes through a shared low-dimensional latent factor structure. This assumption is claimed to allow partial identification of causal effects even under unit interference, with further assumptions on limited interference enabling point identification. The approach is evaluated through simulations demonstrating bias reduction relative to spatial smoothing and panel baselines, and applied to estimate the effect of prenatal PM2.5 exposure on birth weight using California data.
Significance. Should the identification strategy prove robust, the method could advance causal analysis in fields like environmental epidemiology and spatial statistics by handling complex confounding and interference without requiring strong smoothness assumptions on confounders. The simulation results and case study provide initial evidence of practical performance, though generalizability hinges on the factor confounding premise holding in real applications.
major comments (1)
- [Identification argument (abstract and §3)] The central claim that factor confounding suffices for partial identification under interference requires explicit verification that cross-unit confounding paths introduced by the interference kernel remain within the span of the shared latent factors. If the potential outcome model includes a low-rank factor term plus an interference kernel, it is not immediate that the observed data moments continue to bound the target parameter when the kernel is nonzero; this step appears to be the least secure and should be elaborated with a formal derivation or counterexample check.
minor comments (2)
- [Simulation studies] The simulation results should report standard errors or confidence intervals around bias estimates to allow assessment of the magnitude of improvement over baselines.
- [Abstract] Clarify the precise additional assumptions that limit the degree of spatiotemporal interference to achieve point identification, as these are described only at a high level in the abstract.
Simulated Author's Rebuttal
We thank the referee for their careful and constructive review of our manuscript. The major comment raises an important point about the rigor of the identification argument under interference, and we address it directly below with a plan for revision.
read point-by-point responses
-
Referee: [Identification argument (abstract and §3)] The central claim that factor confounding suffices for partial identification under interference requires explicit verification that cross-unit confounding paths introduced by the interference kernel remain within the span of the shared latent factors. If the potential outcome model includes a low-rank factor term plus an interference kernel, it is not immediate that the observed data moments continue to bound the target parameter when the kernel is nonzero; this step appears to be the least secure and should be elaborated with a formal derivation or counterexample check.
Authors: We agree that the current exposition of the identification result under nonzero interference would benefit from greater explicitness. In the revised manuscript we will expand Section 3 with a formal derivation that (i) writes the potential-outcome equation as the sum of the low-rank factor component and the interference kernel, (ii) shows that any additional cross-unit confounding paths generated by the kernel remain in the column space of the shared latent factors, and (iii) verifies that the observed-data moments therefore continue to deliver the same partial-identification bounds on the target causal parameter. We will also include a short discussion of the boundary case in which the kernel introduces confounding outside the factor span, confirming that our maintained assumptions rule this out. These additions will be placed immediately after the statement of the main identification theorem. revision: yes
Circularity Check
No significant circularity; derivation rests on explicit modeling assumptions with external validation
full rationale
The paper states the factor confounding assumption as an explicit premise in the abstract and methods, then derives partial identification results from it under additional interference-limiting assumptions. Simulations and the California birth-weight case study serve as external checks rather than internal reductions. No quoted equations show a target causal effect being recovered by construction from a fitted parameter, nor does any load-bearing step collapse to a self-citation chain or renamed ansatz. The central claim therefore remains independent of its own fitted outputs.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Factor confounding assumption: effects of unmeasured confounders on exposures and outcomes can be captured by a shared latent factor model
- domain assumption Limited spatiotemporal interference assumptions sufficient for point identification
invented entities (1)
-
shared latent factors
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Our method is based on a factor confounding assumption, which posits that effects of unmeasured confounders on exposures and outcomes can be captured by a shared latent factor model.
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Under model (2) and Assumptions 5-7, the causal effect functions gi((Djt)j∈Ni) are identified for all i.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Anderson, T. W. and H. Rubin (1956). Statistical inference in factor analysis. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 5: Contributions to Econometrics, Industrial Research, and Psychometry , Volume 3.5, pp.\ 111--150. University of California Press
work page 1956
-
[2]
Arkhangelsky, D. and G. Imbens (2024). Causal models for longitudinal and panel data: A survey. The Econometrics Journal\/ 27\/ (3), C1--C61
work page 2024
-
[3]
Aronow, P. M. and C. Samii (2017). Estimating average causal effects under general interference, with application to a social network experiment. Annals of Applied Statistics\/ 11\/ (4), 1912--1947
work page 2017
-
[4]
Bai, J. (2009). Panel data models with interactive fixed effects. Econometrica\/ 77\/ (4), 1229--1279
work page 2009
-
[5]
Bai, J. and S. Ng (2002). Determining the number of factors in approximate factor models. Econometrica\/ 70\/ (1), 191--221
work page 2002
-
[6]
Ben-Michael, E., A. Feller, and E. A. Stuart (2021). A trial emulation approach for policy evaluations with group-level longitudinal data. Epidemiology\/ 32\/ (4), 533--540
work page 2021
-
[7]
Bind, M.-A. (2019). Causal modeling in environmental health. Annual review of public health\/ 40\/ (1), 23--43
work page 2019
-
[8]
Bobb, J. F., M. F. Cruz, S. J. Mooney, A. Drewnowski, D. Arterburn, and A. J. Cook (2022). Accounting for spatial confounding in epidemiological studies with individual-level exposures: An exposure-penalized spline approach. Journal of the Royal Statistical Society Series A: Statistics in Society\/ 185\/ (3), 1271--1293
work page 2022
-
[9]
Chernozhukov, V., D. Chetverikov, M. Demirer, E. Duflo, C. Hansen, W. Newey, and J. Robins (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal\/ 21\/ (1), C1--C68
work page 2018
-
[10]
Cinelli, C. and C. Hazlett (2020). Making sense of sensitivity: Extending omitted variable bias. Journal of the Royal Statistical Society Series B-Statistical Methodology\/ 82\/ (1), 39--67
work page 2020
-
[11]
Diez Roux, A. V. (2001). Investigating neighborhood and area effects on health. American journal of public health\/ 91\/ (11), 1783--1789
work page 2001
-
[12]
Dupont, E., S. N. Wood, and N. H. Augustin (2022). Spatial+: a novel approach to spatial confounding. Biometrics\/ 78\/ (4), 1279--1290
work page 2022
-
[13]
Efron, B. (2009). Are a set of microarrays independent of each other? The annals of applied statistics\/ 3\/ (3), 922
work page 2009
-
[14]
Ghosh, R., K. Causey, K. Burkart, S. Wozniak, A. Cohen, and M. Brauer (2021). Ambient and household pm2. 5 pollution and adverse perinatal outcomes: a meta-regression and analysis of attributable global burden for 204 countries and territories. PLoS medicine\/ 18\/ (9), e1003718
work page 2021
-
[15]
Gilbert, B., A. Datta, and E. Ogburn (2021). A causal inference framework for spatial confounding. arXiv preprint arXiv:2112.14946\/
-
[16]
Gong, C., J. Wang, Z. Bai, D. Q. Rich, and Y. Zhang (2022). Maternal exposure to ambient pm2. 5 and term birth weight: a systematic review and meta-analysis of effect estimates. Science of The Total Environment\/ 807 , 150744
work page 2022
-
[17]
Guan, Y., G. L. Page, B. J. Reich, M. Ventrucci, and S. Yang (2023). Spectral adjustment for spatial confounding. Biometrika\/ 110\/ (3), 699--719
work page 2023
-
[18]
Hern \'a n, M. A. and J. M. Robins (2020). Causal Inference: What If . Boca Raton: Chapman & Hall/CRC
work page 2020
-
[19]
Hodges, J. S. and B. J. Reich (2010). Adding spatially-correlated errors can mess up the fixed effect you love. The American Statistician\/ 64\/ (4), 325--334
work page 2010
-
[20]
Hoff, P. D. (2016). Limitations on detecting row covariance in the presence of column covariance. Journal of Multivariate Analysis\/ 152 , 249--258
work page 2016
- [21]
-
[22]
Keller, J. P. and A. A. Szpiro (2020). Selecting a scale for spatial confounding adjustment. Journal of the Royal Statistical Society Series A: Statistics in Society\/ 183\/ (3), 1121--1143
work page 2020
- [23]
-
[24]
Lumley, T. and L. Sheppard (2000). Assessing seasonal confounding and model selection bias in air pollution epidemiology using positive and negative control analyses. Environmetrics: The official journal of the International Environmetrics Society\/ 11\/ (6), 705--717
work page 2000
-
[25]
Manson, S., J. Schroeder, D. Van Riper, K. Knowles, T. Kugler, F. Roberts, and S. Ruggles (2024). Ipums national historical geographic information system: Version 19.0. Dataset
work page 2024
-
[26]
Miao, W., W. Hu, E. L. Ogburn, and X.-H. Zhou (2022). Identifying effects of multiple treatments in the presence of unmeasured confounding. Journal of the American Statistical Association\/ , 1--15
work page 2022
-
[27]
Nyadanu, S. D., J. Dunne, G. A. Tessema, B. Mullins, B. Kumi-Boateng, M. L. Bell, B. Duko, and G. Pereira (2022). Prenatal exposure to ambient air pollution and adverse birth outcomes: an umbrella review of 36 systematic reviews and meta-analyses. Environmental pollution\/ 306 , 119465
work page 2022
-
[28]
Paciorek, C. J. (2010). The importance of scale for spatial-confounding bias and precision of spatial regression estimators. Statistical Science\/ 25\/ (1), 107
work page 2010
-
[29]
Papadogeorgou, G., C. Choirat, and C. M. Zigler (2019). Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching. Biostatistics\/ 20\/ (2), 256--272
work page 2019
- [30]
- [31]
-
[32]
Reich, B. J., S. Yang, Y. Guan, A. B. Giffin, M. J. Miller, and A. Rappold (2021). A review of spatial causal inference methods for environmental and epidemiological applications. International Statistical Review\/ 89\/ (3), 605--634
work page 2021
-
[33]
Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology\/ 66\/ (5), 688
work page 1974
-
[34]
Schnell, P. M. and G. Papadogeorgou (2020). Mitigating unobserved spatial confounding when estimating the effect of supermarket access on cardiovascular disease deaths. The Annals of Applied Statistics\/ 14\/ (4), 2069--2095
work page 2020
-
[35]
Shen, S., C. Li, A. Van Donkelaar, N. Jacobs, C. Wang, and R. V. Martin (2024). Enhancing global estimation of fine particulate matter concentrations by including geophysical a priori information in deep learning. ACS ES&T Air\/ 1\/ (5), 332--345
work page 2024
-
[36]
Shi, X., W. Miao, and E. T. Tchetgen (2020). A selective review of negative control methods in epidemiology. Current epidemiology reports\/ 7 , 190--202
work page 2020
-
[37]
Stan modeling language users guide and reference manual, version 2.32
Stan Development Team (2023). Stan modeling language users guide and reference manual, version 2.32
work page 2023
-
[38]
California Vital Data (Cal-ViDa), Birth Query
State of California, Department of Public Health (2025). California Vital Data (Cal-ViDa), Birth Query . https://cal-vida.cdph.ca.gov/. Last modified Feb 1, 2025
work page 2025
-
[39]
Tchetgen Tchetgen, E. J., A. Ying, Y. Cui, X. Shi, and W. Miao (2024). An introduction to proximal causal inference. Statistical Science\/ 39\/ (3), 375--390
work page 2024
-
[40]
Thaden, H. and T. Kneib (2018). Structural equation models for dealing with spatial confounding. The American Statistician\/ 72\/ (3), 239--252
work page 2018
-
[41]
Tipping, M. E. and C. M. Bishop (1999). Probabilistic principal component analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology\/ 61\/ (3), 611--622
work page 1999
-
[42]
Uwak, I., N. Olson, A. Fuentes, M. Moriarty, J. Pulczinski, J. Lam, X. Xu, B. D. Taylor, S. Taiwo, K. Koehler, et al. (2021). Application of the navigation guide systematic review methodology to evaluate prenatal exposure to particulate matter air pollution and infant birth weight. Environment international\/ 148 , 106378
work page 2021
-
[43]
Vehtari, A., A. Gelman, and J. Gabry (2017). Practical bayesian model evaluation using leave-one-out cross-validation and waic. Statistics and computing\/ 27 , 1413--1432
work page 2017
-
[44]
Zheng, J., A. D'Amour, and A. Franks (2025). Copula-based sensitivity analysis for multi-treatment causal inference with unobserved confounding. Journal of Machine Learning Research\/ 26\/ (36), 1--60
work page 2025
-
[45]
Zheng, J., A. D’Amour, and A. Franks (2022). Bayesian inference and partial identification in multi-treatment causal inference with unobserved confounding. In International Conference on Artificial Intelligence and Statistics , pp.\ 3608--3626. PMLR
work page 2022
-
[46]
Zheng, J., J. Wu, A. D’Amour, and A. Franks (2024). Sensitivity to unobserved confounding in studies with factor-structured outcomes. Journal of the American Statistical Association\/ 119\/ (547), 2026--2037
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.