pith. sign in

arxiv: 2605.22390 · v1 · pith:KINO7LXXnew · submitted 2026-05-21 · 💻 cs.LG

A Posterior-Predictive Variance Decomposition for Epistemic and Aleatoric Uncertainty in Wind Power Forecasting

Pith reviewed 2026-05-22 07:41 UTC · model grok-4.3

classification 💻 cs.LG
keywords wind power forecastinguncertainty decompositionaleatoric uncertaintyepistemic uncertaintyheteroscedastic regressionBayesian neural networksvariance decomposition
0
0 comments X

The pith

Applying the law of total variance separates aleatoric and epistemic uncertainty in wind power neural network forecasts.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a decomposition that breaks total predictive uncertainty in wind power forecasting into aleatoric uncertainty arising from data noise and epistemic uncertainty arising from limited model knowledge. A reader would care because this split indicates whether uncertainty can be reduced by collecting more data or by refining the model. The derivation combines the law of total variance with heteroscedastic neural network regression and Bayesian posterior approximation. The resulting estimators integrate with standard posterior methods and with beta-NLL training to control the mean-variance trade-off. Validation relies on synthetic experiments that inject controlled noise and shifts, property-driven checks on real SCADA turbine data, and dataset-size scaling to confirm expected asymptotic behavior of epistemic uncertainty.

Core claim

By applying the law of total variance to the joint setting of heteroscedastic neural network regression and Bayesian posterior approximation, an explicit decomposition of total uncertainty into aleatoric and epistemic components is derived. The resulting estimators are compatible with standard posterior-approximation methods and with beta-NLL training to regulate the mean-variance learning trade-off.

What carries the argument

Law of total variance decomposition applied to the predictive distribution of a heteroscedastic Bayesian neural network regressor.

If this is right

  • Decomposed aleatoric and epistemic components respond in theoretically expected directions to heteroscedastic noise, distributional shift, and training-scale changes.
  • Epistemic uncertainty decreases with larger training datasets while aleatoric uncertainty remains stable.
  • The estimators integrate directly with existing posterior approximation techniques and beta-NLL training without modification.
  • Real-world SCADA validation on wind turbine data supports operational utility through data-property-driven checks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Grid operators could use the aleatoric part to set fixed operating reserves and the epistemic part to decide when to retrain models on new data.
  • The same variance decomposition approach could be examined for solar generation or electricity load forecasting.
  • The label-free evaluation protocol might serve as a template for validating uncertainty splits in other regression domains.

Load-bearing premise

The three-module evaluation framework of synthetic experiments, SCADA data-property validation, and dataset-size scaling can confirm correct disentanglement of aleatoric and epistemic uncertainty without ground-truth uncertainty labels.

What would settle it

If adding known heteroscedastic noise to synthetic wind data fails to increase the estimated aleatoric component, or if epistemic uncertainty does not decrease as training set size grows, the claimed decomposition would be falsified.

Figures

Figures reproduced from arXiv: 2605.22390 by Kashem M. Muttaqi, Samson S. Yu, Yinsong Chen.

Figure 1
Figure 1. Figure 1: A conceptual illustration of uncertainty disentangle [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Uncertainty disentanglement on heteroscedastic sine using different posterior approximations with [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Uncertainty disentanglement on heteroscedastic sine using different posterior approximations with [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Uncertainty disentanglement on heteroscedastic sine using different posterior approximations with [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Joint distribution of wind speed and power output in [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Uncertainty disentanglement on a SCADA dataset using MC-DropConnect with [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Uncertainty disentanglement on a SCADA dataset using MC-DropConnect with [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Uncertainty disentanglement on a SCADA dataset using Bayes by Backprop with [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Uncertainty disentanglement on a SCADA dataset using Bayes by Backprop with [PITH_FULL_IMAGE:figures/full_fig_p008_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Uncertainty disentanglement on a SCADA dataset using 5 Ensembles with [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Uncertainty disentanglement on a SCADA dataset using 5 Ensembles with [PITH_FULL_IMAGE:figures/full_fig_p008_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Uncertainty disentanglement across dataset sizes for different inference methods on wind power forecasting. [PITH_FULL_IMAGE:figures/full_fig_p010_12.png] view at source ↗
read the original abstract

Accurate wind power forecasting requires reliable uncertainty quantification, yet most existing methods report a single predictive uncertainty that conflates epistemic and aleatoric sources. This paper applies the law of total variance to the joint setting of heteroscedastic neural network regression and Bayesian posterior approximation, deriving an explicit decomposition of total uncertainty (TU) into aleatoric (AU) and epistemic (EU) components. The resulting estimators are compatible with standard posterior-approximation methods and with $\beta$-NLL training to regulate the mean--variance learning trade-off. A wind power--specific evaluation framework is proposed to validate disentanglement without access to ground-truth uncertainty labels, comprising three modules: controlled synthetic experiments to verify responses to heteroscedastic noise and distribution shift; data-property--driven validation on a real-world wind turbine SCADA dataset; and dataset-size scaling experiments to examine the predicted asymptotic behavior of EU. Across synthetic and real-world experiments, the decomposed AU and EU components respond in theoretically consistent directions to noise structure, distributional shift, and training-scale variation, supporting the theoretical consistency and operational utility of the proposed decomposition and evaluation protocol.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper derives an explicit decomposition of total predictive uncertainty into aleatoric (AU) and epistemic (EU) components for heteroscedastic neural network regression in wind power forecasting by applying the law of total variance to the posterior predictive. The resulting estimators are compatible with standard posterior-approximation methods and with β-NLL training. A three-module evaluation framework is proposed—controlled synthetic experiments, data-property-driven validation on a real-world wind turbine SCADA dataset, and dataset-size scaling experiments—to validate the disentanglement in the absence of ground-truth uncertainty labels. Across experiments, the decomposed AU and EU components respond in theoretically consistent directions to noise structure, distributional shift, and training-scale variation.

Significance. If validated, the decomposition offers a principled, operationally useful separation of uncertainty sources for wind power forecasting, where distinguishing aleatoric variability from model uncertainty can inform better grid management and risk assessment. The explicit derivation from the law of total variance, compatibility with existing training and inference techniques, and domain-specific evaluation protocol are strengths. However, the empirical support rests on directional consistency rather than quantitative recovery, which limits the strength of the conclusions.

major comments (2)
  1. [Evaluation Framework] The three-module evaluation framework (synthetic experiments, SCADA validation, and scaling) relies on directional consistency of AU/EU responses to noise, shift, and scale. This is load-bearing for the central claim of correct disentanglement, yet the synthetic module does not report numerical recovery error against ground-truth AU/EU values generated under controlled heteroscedastic noise; many alternative decompositions could produce similar monotonic behaviors.
  2. [Abstract] The abstract claims that experiments support theoretical consistency, but provides no quantitative results, error bars, or details on post-hoc choices in the validation modules. This weakens assessment of whether the reported AU and EU behaviors robustly confirm the decomposition rather than reflecting implementation artifacts.
minor comments (2)
  1. [Method] Clarify the exact form of the posterior approximation (e.g., which variational or sampling method) and how it interfaces with the variance decomposition in the derivation.
  2. [Related Work] Add a brief comparison to prior uncertainty decomposition methods in the related-work section to highlight the specific contribution of the posterior-predictive formulation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback on our manuscript. We address each major comment below and outline revisions that will strengthen the presentation of our evaluation framework and abstract.

read point-by-point responses
  1. Referee: [Evaluation Framework] The three-module evaluation framework (synthetic experiments, SCADA validation, and scaling) relies on directional consistency of AU/EU responses to noise, shift, and scale. This is load-bearing for the central claim of correct disentanglement, yet the synthetic module does not report numerical recovery error against ground-truth AU/EU values generated under controlled heteroscedastic noise; many alternative decompositions could produce similar monotonic behaviors.

    Authors: We agree that quantitative recovery metrics would provide stronger evidence. In the synthetic module we generate data from known heteroscedastic processes, which in principle permits direct comparison of estimated AU to the injected noise variance. We will add a new subsection reporting mean absolute percentage error between estimated AU and ground-truth noise variance across noise levels, as well as the correlation between estimated EU and a proxy (e.g., posterior variance on held-out synthetic replicates). This addition will also discuss why exact ground-truth EU is inherently model-dependent even in simulation, thereby distinguishing our decomposition from alternatives that might exhibit similar monotonic trends. revision: yes

  2. Referee: [Abstract] The abstract claims that experiments support theoretical consistency, but provides no quantitative results, error bars, or details on post-hoc choices in the validation modules. This weakens assessment of whether the reported AU and EU behaviors robustly confirm the decomposition rather than reflecting implementation artifacts.

    Authors: We accept this criticism. The revised abstract will be expanded to include concise quantitative statements (e.g., “AU increased by X% under doubled noise variance while EU remained stable; EU decreased by Y% when training data were scaled from N to 4N”) together with a note that all reported trends were obtained from five independent random seeds with standard-error bars. We will also state that the three validation modules were pre-specified from theoretical predictions rather than chosen after observing results. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation applies standard law of total variance

full rationale

The paper's central derivation applies the law of total variance to the posterior predictive under heteroscedastic regression and Bayesian approximation, yielding an explicit TU = AU + EU split. This identity is an external statistical fact independent of the paper's fitted parameters, network architecture, or β-NLL training. The resulting estimators are stated to be compatible with existing posterior methods rather than being redefined in terms of the outputs. The three-module evaluation protocol (synthetic response tests, SCADA property checks, and scaling) is an empirical validation step that does not feed back into the decomposition equations themselves. No self-citation load-bearing steps, uniqueness theorems, or fitted-input-renamed-as-prediction patterns appear in the derivation chain. The framework therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The derivation rests on the law of total variance and standard assumptions of heteroscedastic regression and Bayesian approximation; no free parameters, invented entities, or ad-hoc axioms are visible in the abstract.

axioms (1)
  • standard math Law of total variance decomposes posterior predictive variance into aleatoric and epistemic components
    Invoked to obtain explicit AU and EU estimators from the joint heteroscedastic-Bayesian setting.

pith-pipeline@v0.9.0 · 5736 in / 1265 out tokens · 26950 ms · 2026-05-22T07:41:05.236856+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages

  1. [1]

    Decomposition- based wind power forecasting models and their boundary issue: An in-depth review and comprehensive discussion on potential solutions,

    Y . Chen, S. Yu, S. Islam, C. P. Lim, and S. Muyeen, “Decomposition- based wind power forecasting models and their boundary issue: An in-depth review and comprehensive discussion on potential solutions,” Energy Reports, vol. 8, pp. 8805–8820, 2022

  2. [2]

    Multi-objective estimation of optimal prediction intervals for wind power forecasting,

    Y . Chen, S. S. Yu, C. P. Lim, and P. Shi, “Multi-objective estimation of optimal prediction intervals for wind power forecasting,”IEEE Transactions on Sustainable Energy, vol. 15, no. 2, pp. 974–985, 2024

  3. [3]

    Uncovering wind power forecasting uncertainty sources and their prop- agation through the whole modelling chain,

    J. Yan, C. M ¨ohrlen, T. G ¨oc ¸men, M. Kelly, A. Wessel, and G. Giebel, “Uncovering wind power forecasting uncertainty sources and their prop- agation through the whole modelling chain,”Renewable and Sustainable Energy Reviews, vol. 165, p. 112519, 2022

  4. [4]

    Offshore wind power system economic evaluation framework under aleatory and epistemic uncertainty,

    A. C. Caputo, A. Federici, P. M. Pelagagge, and P. Salini, “Offshore wind power system economic evaluation framework under aleatory and epistemic uncertainty,”Applied Energy, vol. 350, p. 121585, 2023

  5. [5]

    Ultra-short-term wind power forecasting based on deep bayesian model with uncertainty,

    L. Liu, J. Liu, Y . Ye, H. Liu, K. Chen, D. Li, X. Dong, and M. Sun, “Ultra-short-term wind power forecasting based on deep bayesian model with uncertainty,”Renewable Energy, vol. 205, pp. 598–607, 2023

  6. [6]

    Mixed aleatory-epistemic uncertainty modeling of wind power forecast errors in operation reliability evaluation of power systems,

    J. Ding, K. Xie, B. Hu, C. Shao, T. Niu, C. Li, and C. Pan, “Mixed aleatory-epistemic uncertainty modeling of wind power forecast errors in operation reliability evaluation of power systems,”Journal of Modern Power Systems and Clean Energy, vol. 10, no. 5, pp. 1174–1183, 2022

  7. [7]

    Deep deterministic uncertainty: A new simple baseline,

    J. Mukhoti, A. Kirsch, J. van Amersfoort, P. H. Torr, and Y . Gal, “Deep deterministic uncertainty: A new simple baseline,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 24 384–24 394

  8. [8]

    Prioritized training on points that are learnable, worth learning, and not yet learnt,

    S. Mindermann, J. M. Brauner, M. T. Razzak, M. Sharma, A. Kirsch, W. Xu, B. H ¨oltgen, A. N. Gomez, A. Morisot, S. Farquharet al., “Prioritized training on points that are learnable, worth learning, and not yet learnt,” inInternational Conference on Machine Learning. PMLR, 2022, pp. 15 630–15 649

  9. [9]

    Aleatoric and epistemic uncertainty in machine learning: An introduction to concepts and methods,

    E. H ¨ullermeier and W. Waegeman, “Aleatoric and epistemic uncertainty in machine learning: An introduction to concepts and methods,”Machine learning, vol. 110, no. 3, pp. 457–506, 2021

  10. [10]

    What uncertainties do we need in bayesian deep learning for computer vision?

    A. Kendall and Y . Gal, “What uncertainties do we need in bayesian deep learning for computer vision?”Advances in neural information processing systems, vol. 30, 2017

  11. [11]

    On the pitfalls of het- eroscedastic uncertainty estimation with probabilistic neural networks,

    M. Seitzer, A. Tavakoli, D. Antic, and G. Martius, “On the pitfalls of het- eroscedastic uncertainty estimation with probabilistic neural networks,” inTenth International Conference on Learning Representations (ICLR 2022), 2022

  12. [12]

    Simple and scalable predictive uncertainty estimation using deep ensembles,

    B. Lakshminarayanan, A. Pritzel, and C. Blundell, “Simple and scalable predictive uncertainty estimation using deep ensembles,”Advances in neural information processing systems, vol. 30, 2017

  13. [13]

    Benchmarking uncertainty dis- entanglement: Specialized uncertainties for specialized tasks,

    B. Mucs ´anyi, M. Kirchhof, and S. J. Oh, “Benchmarking uncertainty dis- entanglement: Specialized uncertainties for specialized tasks,”Advances in neural information processing systems, vol. 37, pp. 50 972–51 038, 2024

  14. [14]

    A deeper look into aleatoric and epistemic uncertainty disentanglement,

    M. Valdenegro-Toro and D. S. Mori, “A deeper look into aleatoric and epistemic uncertainty disentanglement,” in2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, 2022, pp. 1508–1516

  15. [15]

    Aleatory or epistemic? does it matter?

    A. Der Kiureghian and O. Ditlevsen, “Aleatory or epistemic? does it matter?”Structural safety, vol. 31, no. 2, pp. 105–112, 2009

  16. [16]

    A review and discussion of decomposition-based hybrid models for wind energy forecasting applications,

    Z. Qian, Y . Pei, H. Zareipour, and N. Chen, “A review and discussion of decomposition-based hybrid models for wind energy forecasting applications,”Applied Energy, vol. 235, pp. 939–953, 2019

  17. [17]

    Interplay between bayesian neural networks and deep learning: A survey,

    Y . Chen, S. Y . Samson, Z. Li, J. K. Eshraghian, and C. P. Lim, “Interplay between bayesian neural networks and deep learning: A survey,”Knowledge-Based Systems, vol. 330, p. 114438, 2025

  18. [18]

    Heteroscedastic gaussian process regression,

    Q. V . Le, A. J. Smola, and S. Canu, “Heteroscedastic gaussian process regression,” inProceedings of the 22nd international conference on Machine learning, 2005, pp. 489–496

  19. [19]

    K. L. Chung,A course in probability theory. Elsevier, 2000

  20. [20]

    Ghosal and A

    S. Ghosal and A. W. Van der Vaart,Fundamentals of nonparametric Bayesian inference. Cambridge University Press, 2017, vol. 44

  21. [21]

    Rudin,Principles of mathematical analysis

    W. Rudin,Principles of mathematical analysis. Academic Press, 2021

  22. [22]

    Deep ensembles: A loss landscape perspective.arXiv preprint arXiv:1912.02757,

    S. Fort, H. Hu, and B. Lakshminarayanan, “Deep ensembles: A loss landscape perspective,”arXiv preprint arXiv:1912.02757, 2019

  23. [23]

    Quan- tifying aleatoric and epistemic uncertainty in machine learning: Are conditional entropy and mutual information appropriate measures?

    L. Wimmer, Y . Sale, P. Hofman, B. Bischl, and E. H ¨ullermeier, “Quan- tifying aleatoric and epistemic uncertainty in machine learning: Are conditional entropy and mutual information appropriate measures?” in Uncertainty in Artificial Intelligence. PMLR, 2023, pp. 2282–2292

  24. [24]

    Measuring uncertainty disentanglement error in classification,

    I. P. de Jong, A. I. Sburlea, and M. Valdenegro-Toro, “Measuring uncertainty disentanglement error in classification,”arXiv preprint arXiv:2408.12175, 2024

  25. [25]

    Probabilistic modelling of wind turbine power curves with application of heteroscedastic gaussian process regression,

    T. Rogers, P. Gardner, N. Dervilis, K. Worden, A. Maguire, E. Pap- atheou, and E. Cross, “Probabilistic modelling of wind turbine power curves with application of heteroscedastic gaussian process regression,” Renewable Energy, vol. 148, pp. 1124–1136, 2020

  26. [26]

    Dropconnect is effective in modeling uncertainty of bayesian deep networks,

    A. Mobiny, P. Yuan, S. K. Moulik, N. Garg, C. C. Wu, and H. Van Nguyen, “Dropconnect is effective in modeling uncertainty of bayesian deep networks,”Scientific reports, vol. 11, no. 1, p. 5458, 2021

  27. [27]

    B. Erisen. (2018) Wind turbine scada dataset. [Online]. Available: https: //www.kaggle.com/datasets/berkerisen/wind-turbine-scada-dataset/data

  28. [28]

    Probabilistic neural network to quantify uncertainty of wind power estimation,

    F. Karami, N. Kehtarnavaz, and M. Rotea, “Probabilistic neural network to quantify uncertainty of wind power estimation,” in2022 IEEE 15th Dallas Circuit And System Conference (DCAS). IEEE, 2022, pp. 1–7

  29. [29]

    Using bias-corrected reanalysis to simulate current and future wind power output,

    I. Staffell and S. Pfenninger, “Using bias-corrected reanalysis to simulate current and future wind power output,”Energy, vol. 114, pp. 1224–1239, 2016