pith. sign in

arxiv: 2605.31489 · v1 · pith:V6T7OMKXnew · submitted 2026-05-29 · 💻 cs.CY

Context-Conditioned Generative Models Enable Subnational Refinement of Sparse Humanitarian Surveys

classification 💻 cs.CY
keywords datagenerativesurveydistributionshumanitarianmodelsscarcitysparse
0
0 comments X
read the original abstract

Data scarcity limits inference in many scientific and policy domains. Survey data are essential for decision-making, but sparse samples often fail to capture fine spatial granularities. We evaluate normalizing flows, a generative model that learns complex data distributions and can be conditioned on exogenous contextual features, in controlled data scarcity scenarios. Across eight household survey datasets spanning six low-income or middle-income countries in the humanitarian domain, we show that context-conditioned generative models can refine sub-national survey distributions under severe data scarcity, and that performance increases systematically with the richness of the conditioning information. These findings support a general principle for survey data augmentation: generative models can improve sub-national estimates when the sparse sample retains sufficient support and contextual covariates encode relevant local heterogeneity. By learning full conditional distributions rather than point estimates, the approach provides fine-grained evidence for humanitarian decision-making and resource allocation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.