Forecasting Residential Heating and Electricity Demand with Scalable, High-Resolution, Open-Source Models
Pith reviewed 2026-05-19 12:25 UTC · model grok-4.3
The pith
Probabilistic models trained on gas-heated region data forecast building-level residential heating and non-heating electricity demand with substantially lower errors than ResStock.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Our probabilistic deep learning models, leveraging multimodal building information and weather data, achieve RMSE scores that are 18.8 percent lower for heating demand and 27.6 percent lower for electricity demand, along with 59 percent improvement in weighted interval score, compared to estimates based on the ResStock model in building-level validation tests.
What carries the argument
Probabilistic deep learning models that process multimodal building-level data including footprints, heights, nearby density, land use, and high-resolution weather to separate and forecast heating and non-heating demands.
If this is right
- Enables scalable high-resolution demand estimation across the residential sector.
- Supports policymakers with granular insights into demand heterogeneity.
- Facilitates more effective grid planning and decarbonization strategies for buildings.
- Provides an open-source platform that can be applied broadly for climate goals.
Where Pith is reading between the lines
- Applying the same separation logic in all-electric regions could reveal if heating electrification patterns differ from the training assumptions.
- The approach might combine with satellite imagery for even finer spatial resolution in future updates.
- Similar models could help forecast demand in other sectors like commercial buildings where data is sparse.
Load-bearing premise
The training data from a gas-heated region accurately capture non-heating electricity patterns without any leftover heating-related consumption affecting the separation.
What would settle it
A direct comparison of the model's predictions against measured heating and electricity demand in a region where electric heating is common would test whether the learned patterns truly isolate non-heating uses.
Figures
read the original abstract
We present a novel framework for high-resolution forecasting of residential heating demand and non-heating electricity demand using probabilistic deep learning models. Because our models are trained on electricity consumption from a predominantly gas-heated region, the learned electricity demand patterns primarily reflect non-heating end uses such as lighting, appliances, and cooling. We focus specifically on providing hourly building-level electricity and heating demand forecasts for the residential sector. Leveraging multimodal building-level information -- including data on building footprint areas, heights, nearby building density, nearby building size, land use patterns, and high-resolution weather data -- and probabilistic modeling, our methods provide granular insights into demand heterogeneity. Validation at the building level underscores a step change improvement in performance relative to NREL's ResStock model, which has emerged as a research community standard for residential heating and electricity demand characterization. In building-level heating and electricity estimation backtests, our probabilistic models respectively achieve RMSE scores 18.8% and 27.6% lower than those based on ResStock, with probabilistic forecast quality measured via WIS improving by 59% for both applications. By offering an open-source, scalable, high-resolution platform for demand estimation and forecasting, this research advances the tools available for policymakers and grid planners, contributing to the broader effort to decarbonize the U.S. building stock and meeting climate objectives.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a novel framework for high-resolution forecasting of residential heating demand and non-heating electricity demand using probabilistic deep learning models. Trained on electricity consumption from a predominantly gas-heated region, the models leverage multimodal building-level data (footprint areas, heights, density, land use) and high-resolution weather to generate hourly probabilistic forecasts. It claims a step-change improvement over NREL's ResStock benchmark, with building-level backtests showing 18.8% lower RMSE for heating, 27.6% lower RMSE for electricity, and 59% better WIS for both.
Significance. If the reported performance gains prove robust, the open-source and scalable nature of the platform would represent a meaningful advance over existing tools like ResStock for granular, probabilistic demand estimation. This could support more accurate policy analysis and grid planning for building decarbonization. The use of multimodal inputs and probabilistic modeling directly addresses demand heterogeneity, which is a noted limitation in coarser models.
major comments (2)
- Abstract: The abstract reports specific RMSE improvements (18.8% for heating, 27.6% for electricity) and WIS gains (59%) but supplies no information on model architecture, training procedure, data sources, cross-validation strategy, or statistical significance testing. This prevents assessment of whether the claimed gains are robust.
- Abstract: The central performance claims rest on the assumption that electricity consumption data from a 'predominantly gas-heated region' captures only non-heating end uses without residual heating-related electricity consumption. No verification, sensitivity analysis, or discussion of potential contamination (e.g., from electric resistance or heat pumps) is provided, which is load-bearing for the separation task and the direct comparison to ResStock.
minor comments (1)
- Abstract: The phrase 'multimodal building-level information' lists several variables but would benefit from explicit examples or a brief definition to improve immediate clarity for readers.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments. We address each major comment below, indicating planned revisions to the manuscript where appropriate.
read point-by-point responses
-
Referee: Abstract: The abstract reports specific RMSE improvements (18.8% for heating, 27.6% for electricity) and WIS gains (59%) but supplies no information on model architecture, training procedure, data sources, cross-validation strategy, or statistical significance testing. This prevents assessment of whether the claimed gains are robust.
Authors: The abstract is intentionally concise to highlight the core contribution and results. The full manuscript details the probabilistic deep learning models (including architecture), training procedures on electricity consumption data, multimodal building and weather data sources, cross-validation approach in the building-level backtests, and statistical comparisons to ResStock. To facilitate assessment without requiring readers to consult the full text immediately, we will revise the abstract to include a brief summary of the methodological framework and validation strategy. revision: yes
-
Referee: Abstract: The central performance claims rest on the assumption that electricity consumption data from a 'predominantly gas-heated region' captures only non-heating end uses without residual heating-related electricity consumption. No verification, sensitivity analysis, or discussion of potential contamination (e.g., from electric resistance or heat pumps) is provided, which is load-bearing for the separation task and the direct comparison to ResStock.
Authors: The manuscript explicitly selects a predominantly gas-heated region to isolate non-heating electricity end uses and discusses this choice as central to the separation of heating and non-heating demands. We agree that the abstract itself provides limited elaboration on verification or sensitivity to residual electric heating. In the revised version, we will expand the relevant sections to include additional discussion of potential contamination risks, regional heating characteristics, and any robustness checks supporting the assumption. revision: yes
Circularity Check
No circularity; claims rest on external benchmark comparison
full rationale
The provided abstract presents empirical backtest results (RMSE and WIS improvements versus NREL's ResStock) as direct numerical comparisons to an independent external model. The training-data assumption about gas-heated regions is stated explicitly as a premise rather than derived from any equation, fitted parameter, or self-citation chain. No derivation steps, equations, or load-bearing self-references appear that would reduce the reported performance gains to tautological inputs by construction. The validation therefore remains self-contained against an outside standard.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
probabilistic deep learning models... trained on electricity consumption from a predominantly gas-heated region... RMSE scores 18.8% and 27.6% lower than those based on ResStock
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
The long-term strategy of the united states: Pathways to net-zero greenhouse gas emissions by 2050,
United States Government, “The long-term strategy of the united states: Pathways to net-zero greenhouse gas emissions by 2050,” 2021
work page 2050
-
[2]
Decarbonizing the US Economy by 2050: A National Blueprint for the Buildings Sector,
J. Langevin, E. Wilson, C. Snyder, R. Narayanamurthy, J. Miller, K. Kaplan, M. Reiner, R. Risser, M. Mahoney, J. Geyer, et al., “Decarbonizing the US Economy by 2050: A National Blueprint for the Buildings Sector,” 2024
work page 2050
-
[3]
Sources of greenhouse gas emissions,
United States Environmental Protection Agency, “Sources of greenhouse gas emissions,” 2025
work page 2025
-
[4]
2020 Residential U.S. Heat Pump Market Update,
Atlas Buildings Hub, “2020 Residential U.S. Heat Pump Market Update,” 2023
work page 2020
-
[5]
Electricity load implications of space heating decarbonization pathways,
M. Waite and V . Modi, “Electricity load implications of space heating decarbonization pathways,” Joule, vol. 4, no. 2, pp. 376– 394, 2020
work page 2020
-
[6]
U.S. Department of Energy, “End-use load profiles for the u.s. building stock: Methodology and results of model calibration, validation, and uncertainty quantification,” tech. rep., Department of Energy, 2022
work page 2022
-
[7]
ResStock 2024.2 Re- lease: Technical Documentation,
National Renewable Energy Laboratory, “ResStock 2024.2 Re- lease: Technical Documentation,” tech. rep., National Renewable Energy Laboratory, 2024
work page 2024
-
[8]
Comstock reference documentation: 2024 release 1,
National Renewable Energy Laboratory, “Comstock reference documentation: 2024 release 1,” tech. rep., National Renewable Energy Laboratory, 2024
work page 2024
-
[9]
R. Khorramfar, M. Santoni-Colvin, S. Amin, L. K. Norford, A. Botterud, and D. Mallapragada, “Cost-effective planning of decarbonized power-gas infrastructure to meet the challenges of heating electrification,” 2023
work page 2023
-
[10]
California Public Utilities Commission, Energy Division, “Elec- trification impacts study part i: Bottom-up load forecasting and system-level electrification impacts cost estimates,” tech. rep., California Public Utilities Commission, San Francisco, Califor- nia, 2023
work page 2023
-
[11]
Model america–data and models of every us building,
J. New, M. Adams, A. Berres, B. Bass, and N. Clinton, “Model america–data and models of every us building,” tech. rep., Oak Ridge National Laboratory, 2021
work page 2021
- [12]
-
[13]
Automatic building energy modeling (autobem),
Oak Ridge National Laboratory, “Automatic building energy modeling (autobem),” 2024
work page 2024
-
[14]
2023 long-term load forecasting workshop,
Energy Systems Integration Group (ESIG), “2023 long-term load forecasting workshop,” 2023. Avail- able at:https://www.esig.energy/event/ 2023-long-term-load-forecasting-workshop/
work page 2023
-
[15]
Degree days - energy explained,
U.S. Energy Information Administration (EIA), “Degree days - energy explained,” n.d. Available at:https://www. eia.gov/energyexplained/units-and-calculators/ degree-days.php
-
[16]
American Society of Heating, Refrigerating and Air- Conditioning Engineers, ASHRAE Handbook – Fundamentals. Atlanta, GA: ASHRAE, 2021
work page 2021
-
[17]
A global model of hourly space heating and cooling demand at multiple spatial scales,
I. Staffell, S. Pfenninger, and N. Johnson, “A global model of hourly space heating and cooling demand at multiple spatial scales,”Nature Energy, vol. 8, no. 12, pp. 1328–1344, 2023
work page 2023
-
[18]
L. Kannari, J. Kiljander, K. Piira, J. Piippo, and P. Koponen, “Building heat demand forecasting by training a common ma- chine learning model with physics-based simulator,”Forecasting, vol. 3, no. 2, pp. 290–302, 2021
work page 2021
-
[19]
Improved day ahead heating demand forecasting by online correction methods,
F. B ¨unning, P. Heer, R. S. Smith, and J. Lygeros, “Improved day ahead heating demand forecasting by online correction methods,” Energy and Buildings, vol. 211, p. 109821, 2020
work page 2020
-
[20]
Machine-learning-based multi-step heat demand forecasting in a district heating system,
P. Poto ˇcnik, P. ˇSkerl, and E. Govekar, “Machine-learning-based multi-step heat demand forecasting in a district heating system,” Energy and Buildings, vol. 233, p. 110673, 2021
work page 2021
-
[21]
A. T. Eseye and M. Lehtonen, “Short-term forecasting of heat demand of buildings for efficient and optimal energy man- agement based on integrated machine learning models,” IEEE Transactions on Industrial Informatics, vol. 16, no. 12, pp. 7743– 15 7755, 2020
work page 2020
-
[22]
Machine learning ap- proaches for estimating commercial building energy consump- tion,
C. Robinson, B. Dilkina, J. Hubbs, W. Zhang, S. Guhathakurta, M. A. Brown, and R. M. Pendyala, “Machine learning ap- proaches for estimating commercial building energy consump- tion,”Applied Energy, vol. 208, pp. 889–904, 2017
work page 2017
-
[23]
H. Deng, D. Fannon, and M. J. Eckelman, “Predictive modeling for us commercial building energy use: A comparison of existing statistical and machine learning algorithms using cbecs micro- data,”Energy and Buildings, vol. 163, pp. 34–43, 2018
work page 2018
-
[24]
Machine learning for early stage building energy prediction: Increment and enrich- ment,
M. M. Singh, S. Singaravel, and P. Geyer, “Machine learning for early stage building energy prediction: Increment and enrich- ment,”Applied Energy, vol. 304, p. 117787, 2021
work page 2021
-
[25]
R. Olu-Ajayi, H. Alaka, I. Sulaimon, F. Sunmola, and S. Ajayi, “Building energy consumption prediction for residential build- ings using deep learning and other machine learning techniques,” Journal of Building Engineering, vol. 45, p. 103406, 2022
work page 2022
-
[26]
Predicting energy consumption for residential buildings using ann through parametric modeling,
E. Elbeltagi and H. Wefki, “Predicting energy consumption for residential buildings using ann through parametric modeling,” Energy Reports, vol. 7, pp. 2534–2545, 2021
work page 2021
-
[27]
Z. Li, J. Dai, H. Chen, and B. Lin, “An ann-based fast building energy consumption prediction method for complex architectural form at the early design stage,” vol. 12, pp. 665–681, 2019
work page 2019
-
[28]
M. Zhang, M.-A. Millar, S. Chen, Y . Ren, Z. Yu, and J. Yu, “En- hancing hourly heat demand prediction through artificial neural networks: A national level case study,” Energy and AI, vol. 15, p. 100315, 2024
work page 2024
-
[29]
A comparison of models for forecasting the residential natural gas demand of an urban area,
P. Poto ˇcnik, J. ˇSilc, G. Papa, et al., “A comparison of models for forecasting the residential natural gas demand of an urban area,” Energy, vol. 167, pp. 511–522, 2019
work page 2019
-
[30]
W. Panek and T. Włodek, “Natural gas consumption forecasting based on the variability of external meteorological factors using machine learning algorithms,” Energies, vol. 15, no. 1, p. 348, 2022
work page 2022
-
[31]
M. Van Hove, M. Delghust, and J. Laverge, “Data-driven ma- chine learning model performance of real annual natural gas con- sumption in residential buildings,” in2022 Building Performance Analysis Conference and SimBuild, pp. 181–188, 2022
work page 2022
-
[32]
H. Su, E. Zio, J. Zhang, M. Xu, X. Li, and Z. Zhang, “A hy- brid hourly natural gas demand forecasting method based on the integration of wavelet transform and enhanced deep-rnn model,” Energy, vol. 178, pp. 585–597, 2019
work page 2019
-
[33]
Natural gas consump- tion forecast with mars and cmars models for residential users,
A. ¨Ozmen, Y . Yılmaz, and G.-W. Weber, “Natural gas consump- tion forecast with mars and cmars models for residential users,” Energy Economics, vol. 70, pp. 357–381, 2018
work page 2018
- [34]
-
[35]
World settlement footprint 3d-a first three-dimensional survey of the global building stock,
T. Esch, E. Brzoska, S. Dech, B. Leutner, D. Palacios-Lopez, A. Metz-Marconcini, M. Marconcini, A. Roth, and J. Zeidler, “World settlement footprint 3d-a first three-dimensional survey of the global building stock,” Remote sensing of environment, vol. 270, p. 112877, 2022
work page 2022
-
[36]
Global land use/land cover with sentinel 2 and deep learning,
K. Karra, C. Kontgis, Z. Statman-Weil, J. C. Mazzariello, M. Mathis, and S. P. Brumby, “Global land use/land cover with sentinel 2 and deep learning,” in 2021 IEEE international geoscience and remote sensing symposium IGARSS, pp. 4704– 4707, IEEE, 2021
work page 2021
-
[37]
Annual time series of global viirs nighttime lights derived from monthly averages: 2012 to 2019,
C. D. Elvidge, M. Zhizhin, T. Ghosh, F.-C. Hsu, and J. Taneja, “Annual time series of global viirs nighttime lights derived from monthly averages: 2012 to 2019,”Remote Sensing, vol. 13, no. 5, p. 922, 2021
work page 2012
-
[38]
Speedtest by Ookla Global Fixed and Mobile Net- work Performance Map Tiles,
Ookla, “Speedtest by Ookla Global Fixed and Mobile Net- work Performance Map Tiles,” 2022.https://github.com/ teamookla/ookla-open-data
work page 2022
-
[39]
Era5 hourly data on single levels from 1940 to present,
C. D. S. Copernicus Climate Change Service, “Era5 hourly data on single levels from 1940 to present,” Copernicus Climate Change Service (C3S) Climate Data Store (CDS), 2023
work page 1940
-
[40]
Thefuzz: Fuzzy string matching in python
SeatGeek, “Thefuzz: Fuzzy string matching in python.”https: //github.com/seatgeek/thefuzz, 2021
work page 2021
-
[41]
S. J. Lee, Multimodal Data Fusion for Estimating Electricity Access and Demand. PhD thesis, Massachusetts Institute of Tech- nology, 2023
work page 2023
-
[42]
Oikolab, “Weather downloader.” Online, 2024. Available: https://weatherdownloader.oikolab.com/app
work page 2024
-
[43]
NREL, “Basic tutorial.” Online, 2024. Available: https://resstock.readthedocs.io/en/v3.3.0/basic_ tutorial/index.html
work page 2024
-
[44]
2020 Public Use Microdata Areas (PUMA) Shapefile,
IPUMS USA, “2020 Public Use Microdata Areas (PUMA) Shapefile,” 2024. Available at:https://usa.ipums.org/ usa/volii/pumas20.shtml. 16
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.