{"paper":{"title":"A Joint Synthetic Housing-Household Inventory","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"A framework generates synthetic data pairing specific housing units with compatible households while matching real block-group demographics.","cross_cats":[],"primary_cat":"cs.CY","authors_text":"Rachel Davidson, Shangjia Dong, Xiao Qian","submitted_at":"2026-05-16T14:58:44Z","abstract_excerpt":"Accurately understanding the interactions between humans and the built environment requires integrated representations of both the buildings and the populations that occupy them. However, high-fidelity datasets that jointly capture detailed housing structures and demographic characteristics at the household level do not currently exist. This paper presents a framework for constructing a joint housing-household inventory that explicitly links individuals and households to compatible housing units from the National Structure Inventory (NSI), while preserving realistic population densities and de"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"The generated joint inventory matches block-group-level demographic distributions, reproduces observed spatial population patterns without systematic bias, and maintains consistent allocation quality across urban, suburban, and rural contexts.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"The deep contrastive learning model accurately quantifies true housing-household compatibility in a way that, when fed into the hierarchical optimization, produces allocations that preserve realistic joint distributions rather than merely satisfying aggregate constraints.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"A framework integrates synthetic population generation from ACS PUMS, deep contrastive learning for housing-household compatibility, and hierarchical optimization to produce a joint inventory that matches block-group demographics and spatial patterns in coastal North Carolina.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"A framework generates synthetic data pairing specific housing units with compatible households while matching real block-group demographics.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"5b7511d6b21e49322147dc37f8aecb07239d502b5d9370011d05a1b984628a88"},"source":{"id":"2605.17031","kind":"arxiv","version":1},"verdict":{"id":"0b0e0eed-9be2-4990-83fe-7a63706a14a3","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-19T18:49:20.748369Z","strongest_claim":"The generated joint inventory matches block-group-level demographic distributions, reproduces observed spatial population patterns without systematic bias, and maintains consistent allocation quality across urban, suburban, and rural contexts.","one_line_summary":"A framework integrates synthetic population generation from ACS PUMS, deep contrastive learning for housing-household compatibility, and hierarchical optimization to produce a joint inventory that matches block-group demographics and spatial patterns in coastal North Carolina.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"The deep contrastive learning model accurately quantifies true housing-household compatibility in a way that, when fed into the hierarchical optimization, produces allocations that preserve realistic joint distributions rather than merely satisfying aggregate constraints.","pith_extraction_headline":"A framework generates synthetic data pairing specific housing units with compatible households while matching real block-group demographics."},"integrity":{"clean":true,"summary":{"advisory":0,"critical":0,"by_detector":{},"informational":0},"endpoint":"/pith/2605.17031/integrity.json","findings":[],"available":true,"detectors_run":[{"name":"citation_quote_validity","ran_at":"2026-05-19T19:49:43.774408Z","status":"completed","version":"0.1.0","findings_count":0},{"name":"doi_title_agreement","ran_at":"2026-05-19T19:01:18.794667Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"doi_compliance","ran_at":"2026-05-19T19:00:41.329026Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"cited_work_retraction","ran_at":"2026-05-19T18:51:57.128565Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"claim_evidence","ran_at":"2026-05-19T18:41:56.172552Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"ai_meta_artifact","ran_at":"2026-05-19T18:33:23.489066Z","status":"skipped","version":"1.0.0","findings_count":0}],"snapshot_sha256":"54ddfbc77687dee99525092915d9927bc48c3f156ab91b9ece4ebe5064d15ed6"},"references":{"count":41,"sample":[{"doi":"","year":2013,"title":"Transportation Research Part B: Methodological , volume=","work_id":"358dda0b-3a34-4c84-b6d5-6b7870be7fa3","ref_index":1,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2023,"title":"International Conference on Machine Learning , pages=","work_id":"fff2b0af-e305-4163-9688-c193c67d5704","ref_index":2,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2021,"title":"Sustainable and Resilient Infrastructure , volume=","work_id":"92cea87a-c8ca-41a4-ae33-e04285614e87","ref_index":3,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2018,"title":"Nature Climate Change , volume=","work_id":"4e71ef19-efe3-4e0d-be8a-a178764b2cd7","ref_index":4,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"Advances in neural information processing systems , volume=","work_id":"b459211e-36d2-4b63-99b0-c4dd6f5b840c","ref_index":5,"cited_arxiv_id":"","is_internal_anchor":false}],"resolved_work":41,"snapshot_sha256":"b2b4482c38a593e2b09bb893025aa40d3f841098b6424339dd4869fe953997f8","internal_anchors":0},"formal_canon":{"evidence_count":2,"snapshot_sha256":"21d0172d6f6d1ff8d40be42df83b9ec777e3c7c61d0685cee6cf33b8398442ce"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"}