Physics-Informed Neural Networks with Learnable Loss Balancing and Transfer Learning

Reza Pirayeshshirazinezhad

arxiv: 2605.05217 · v1 · submitted 2026-04-17 · 💻 cs.LG · cs.AI

Physics-Informed Neural Networks with Learnable Loss Balancing and Transfer Learning

Reza Pirayeshshirazinezhad This is my paper

Pith reviewed 2026-05-10 09:23 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords physics-informed neural networkslearnable loss balancingtransfer learningadaptive weightingdata scarcityheat transfer predictionscientific machine learning

0 comments

The pith

A learnable blending neuron in PINNs dynamically balances physics and data terms based on uncertainties for improved performance with scarce data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces an adaptive physics-informed neural network that uses a learnable blending neuron to automatically adjust the importance of physics residuals versus data losses according to their uncertainties. The goal is to eliminate manual tuning of loss weights, which often destabilizes training in traditional PINNs, especially when data is limited. Transfer learning is added to leverage pre-trained representations from similar problems and fine-tune on new ones with minimal data. In a test case predicting heat transfer in miniature heat sinks using just 87 data points from CFD simulations, the method achieves less than 8% error while surpassing shallow networks, kernel methods, and physics-only models. Readers might care because this offers a more robust way to incorporate physical laws into machine learning for scientific problems where gathering data is costly.

Core claim

The authors establish that replacing fixed or heuristic loss weights in PINNs with a learnable blending neuron enables the network to dynamically set the relative contributions of physics-based and data-driven terms according to their uncertainties. Paired with a transfer learning strategy that reuses domain representations, this framework delivers stable training and accurate predictions on data-scarce tasks, such as heat transfer modeling in liquid-metal heat sinks where errors stay below 8%.

What carries the argument

The learnable blending neuron, a component that computes dynamic weights for the combined loss function by considering the uncertainties of individual physics and data loss terms.

Load-bearing premise

That the learnable blending neuron will produce stable training and superior generalization on new physical systems rather than overfitting to the uncertainty patterns of the 87-point heat transfer dataset.

What would settle it

Demonstrating that when applied to a distinct physical system with different data characteristics, the adaptive PINN exhibits unstable training or errors exceeding those of fixed-weight baselines.

Figures

Figures reproduced from arXiv: 2605.05217 by Reza Pirayeshshirazinezhad.

**Figure 2.** Figure 2: Effect of transfer learning from different layers. The first-layer transfer achieves the lowest [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: KDE comparison of water and sodium Nusselt numbers. Sodium exhibits higher variance [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Distribution of the learned physics coefficient neuron in the self-supervised PINN, cen [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: GP and SVR predictions on holdout dataset. SVR–Bayesian achieves better fit than GP, [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 6.** Figure 6: Neural network predictions on holdout dataset. The self-supervised PINN provides the [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

read the original abstract

We propose a self-supervised physics-informed neural network (PINN) framework that adaptively balances physics-based and data-driven supervision for scientific machine learning under data scarcity. Unlike prior PINNs that rely on fixed or heuristic weighting of physics residuals and data loss, our approach introduces a learnable blending neuron that dynamically adjusts the relative contribution of each term based on their uncertainties. This mechanism enables stable training and improved generalization without manual tuning. To further enhance efficiency, we integrate a transfer learning strategy that reuses representations from related domains and adapts them to new physical systems with limited data. We validate the framework for the prediction of heat transfer in liquid-metal miniature heat sinks using only 87 CFD datapoints, where the adaptive PINN achieves an error <8%, outperforming shallow neural networks, kernel methods, and physics-only baselines. Our framework provides a general recipe for embedding physics adaptively into neural networks, offering a robust and reproducible approach for data-scarce problems across various scientific domains, including fluid dynamics and material modeling.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The learnable blending neuron for adaptive PINN loss weighting plus transfer learning is a reasonable incremental step for data-scarce heat-transfer modeling, but the 87-point validation does not yet establish reliable generalization.

read the letter

The paper's main contribution is a PINN variant that adds a learnable blending neuron to adjust the weights between physics residuals and data loss terms dynamically, combined with transfer learning to handle very limited data. They apply this to predicting heat transfer in liquid-metal miniature heat sinks using 87 CFD points and report under 8% error while beating shallow networks, kernel methods, and physics-only baselines. The goal of reducing manual hyperparameter tuning for loss balancing is practical for engineering applications where data is expensive to generate. The transfer learning angle also makes sense for adapting models across related physical systems. That part of the work is straightforward and addresses a real pain point in PINN training. The soft spots are in the evidence and implementation transparency. The abstract gives no architecture details, no derivation showing how the neuron actually ties outputs to uncertainties rather than learning arbitrary weights, no ablations isolating the neuron's contribution from transfer learning, and no mention of error bars, cross-validation, or regularization on the extra parameters. With only 87 points, the reported gains could easily come from overfitting to dataset-specific patterns instead of learning a general balancing rule that transfers. The outperformance claims need the full methods section and more experiments on other systems to hold up. This is for people already working on PINNs for fluid dynamics or materials modeling with sparse data. A reader in that subfield could pick up the blending idea for their own experiments, but the current results are too preliminary for broader adoption. I would send it to peer review so the authors can add the missing details, ablations, and tests on additional datasets.

Referee Report

3 major / 0 minor

Summary. The manuscript proposes a self-supervised PINN framework that uses a learnable blending neuron to dynamically balance physics residuals and data losses according to estimated uncertainties, augmented by transfer learning to adapt representations across physical systems under data scarcity. It validates the approach on heat-transfer prediction for liquid-metal miniature heat sinks using only 87 CFD points, reporting error below 8% and outperformance over shallow neural networks, kernel methods, and physics-only baselines.

Significance. If the central claims hold after addressing the noted gaps, the work would contribute a practical mechanism for automating loss weighting in PINNs, a persistent practical challenge, while the transfer-learning component offers a route to leverage related domains for new systems. The emphasis on a small, engineering-relevant dataset is a strength for demonstrating applicability in data-scarce scientific ML; credit is due for attempting to ground the blending in uncertainty rather than heuristics.

major comments (3)

[Abstract] Abstract: the performance claim of error <8% and outperformance is presented without details on network architecture, the internal structure or uncertainty estimation inside the blending neuron, baseline implementations, cross-validation procedure, or error bars, rendering the central empirical claim impossible to assess.
[Methods] Methods: no mathematical formulation, pseudocode, or architectural diagram specifies how the blending neuron computes or uses uncertainties to set weights (as opposed to learning arbitrary coefficients); the added free parameters therefore risk reducing to a fitted weighting scheme on the 87-point dataset rather than enforcing the claimed uncertainty-based mechanism.
[Experiments] Experiments/Results: validation is restricted to a single heat-transfer task plus transfer learning; no ablation isolates the blending neuron's contribution from the transfer-learning component or from a standard PINN, and no additional physical systems are tested to support the generalization claim without manual tuning.

Simulated Author's Rebuttal

3 responses · 1 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below, indicating where we agree and the revisions we will implement.

read point-by-point responses

Referee: [Abstract] Abstract: the performance claim of error <8% and outperformance is presented without details on network architecture, the internal structure or uncertainty estimation inside the blending neuron, baseline implementations, cross-validation procedure, or error bars, rendering the central empirical claim impossible to assess.

Authors: We agree that the abstract is too concise to allow full assessment of the central claims. In the revised manuscript, we will expand the abstract with a brief description of the network architecture, the blending neuron's internal structure for uncertainty estimation, the baseline implementations, the cross-validation procedure, and the inclusion of error bars on reported errors. This will improve assessability without substantially increasing length. revision: yes
Referee: [Methods] Methods: no mathematical formulation, pseudocode, or architectural diagram specifies how the blending neuron computes or uses uncertainties to set weights (as opposed to learning arbitrary coefficients); the added free parameters therefore risk reducing to a fitted weighting scheme on the 87-point dataset rather than enforcing the claimed uncertainty-based mechanism.

Authors: This is a fair critique of the current presentation. The manuscript describes the blending neuron at a conceptual level but lacks the requested explicit details. In the revision, we will add the mathematical formulation showing how the neuron uses estimated uncertainties to compute weights, pseudocode for the loss balancing procedure, and an architectural diagram. These additions will demonstrate that the mechanism is uncertainty-driven rather than an arbitrary fit. revision: yes
Referee: [Experiments] Experiments/Results: validation is restricted to a single heat-transfer task plus transfer learning; no ablation isolates the blending neuron's contribution from the transfer-learning component or from a standard PINN, and no additional physical systems are tested to support the generalization claim without manual tuning.

Authors: We acknowledge that the experimental section would be strengthened by ablations and broader testing. We will add ablation experiments in the revision to isolate the blending neuron's contribution from both transfer learning and a standard PINN baseline. However, extending validation to multiple additional physical systems would require new datasets and experiments outside the current scope; we will instead expand the discussion to explain how the transfer learning results already demonstrate adaptation without manual tuning on the target task. revision: partial

standing simulated objections not resolved

Requirement to test on multiple additional physical systems beyond the current heat-transfer task and transfer learning setup, as this would necessitate substantial new data collection and experiments not performed in the original work.

Circularity Check

0 steps flagged

No circularity in derivation chain; results grounded in external data

full rationale

The paper defines an architectural extension (learnable blending neuron for loss weighting plus transfer learning) and reports empirical performance on an independent 87-point CFD heat-transfer dataset. No equations or claims are shown that algebraically reduce the reported error or generalization improvement to the model inputs by construction. The validation uses external benchmarks (shallow NNs, kernel methods, physics-only baselines) rather than internal self-consistency loops. No self-definitional steps, fitted parameters renamed as predictions, or load-bearing self-citations appear in the derivation.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 1 invented entities

The central claim rests on standard PINN assumptions plus two new elements whose behavior is not independently verified outside the reported experiment.

free parameters (1)

blending neuron parameters
Weights inside the learnable neuron are optimized during training to set the relative loss contributions.

axioms (2)

standard math Physics residuals can be evaluated via automatic differentiation inside the neural network
Core assumption of all PINNs; invoked when the physics loss term is formed.
domain assumption Representations learned on related physical domains transfer usefully to the target heat-transfer problem
Required for the transfer-learning component to reduce data needs.

invented entities (1)

learnable blending neuron no independent evidence
purpose: Dynamically compute loss weights from estimated uncertainties of physics and data terms
New architectural component introduced to replace fixed or heuristic weighting.

pith-pipeline@v0.9.0 · 5470 in / 1530 out tokens · 48560 ms · 2026-05-10T09:23:24.953268+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

30 extracted references · 30 canonical work pages

[1]

and Kahani, M

Baghban, A. and Kahani, M. and Nazari, M.A. and Ahmadi, M.H. and Yan, W.-M. , title =. International Journal of Heat and Mass Transfer , volume =

work page
[2]

and Bahadori, M

Baghban, A. and Bahadori, M. and Rozyn, J. and Lee, M. and Abbas, A. and Bahadori, A. and Rahimali, A. , title =. Applied Thermal Engineering , volume =

work page
[3]

and Kayfeci, M

Kurt, H. and Kayfeci, M. , title =. Applied Energy , volume =

work page
[4]

and Karimi, H

Yousefi, F. and Karimi, H. and Papari, M.M. , title =. Journal of Molecular Liquids , volume =

work page
[5]

and Güemes, A

Guastoni, L. and Güemes, A. and Ianiro, A. and Discetti, S. and Schlatter, P. and Azizpour, H. and Vinuesa, R. , title =. Journal of Fluid Mechanics , volume =

work page
[6]

Energies , volume =

Sharma, Pushan and Chung, Wai Tong and Akoush, Bassem and Ihme, Matthias , title =. Energies , volume =

work page
[7]

and Lee, J

Jeon, J. and Lee, J. and Kim, S.J. , title =. International Journal of Energy Research , volume =

work page
[8]

and Lee, J

Jeon, J. and Lee, J. and Eivazi, H. and Vinuesa, R. and Kim, S.J. , title =. 2206.06817 , archivePrefix =

work page arXiv
[9]

and Qi, Z

Zhuang, F. and Qi, Z. and Duan, K. and Xi, D. and Zhu, Y. and Zhu, H. and Xiong, H. and He, Q. , title =. Proceedings of the IEEE , volume =

work page
[10]

and Chen, K

Liu, J. and Chen, K. and Xu, G. and Sun, X. and Yan, M. and Diao, W. and Han, H. , title =. IEEE Geoscience and Remote Sensing Letters , volume =

work page
[11]

and Biedron, S

Pirayesh, R. and Biedron, S. and Cruz, J.A.D. and Güitrón, S.S. and Martínez-Ramón, M. , title =. Preprint , year =

work page
[12]

arXiv preprint arXiv:2509.05886 , year=

SPINN: An Optimal Self-Supervised Physics-Informed Neural Network Framework , author=. arXiv preprint arXiv:2509.05886 , year=

work page arXiv
[13]

and Verma, M.K

Bhattacharya, S. and Verma, M.K. and Bhattacharya, A. , title =. Physics of Fluids , volume =

work page
[14]

and Mahian, O

Tafarroj, M.M. and Mahian, O. and Kasaeian, A. and Sakamatapan, K. and Dalkilic, A.S. and Wongwises, S. , title =. International Communications in Heat and Mass Transfer , volume =

work page
[15]

, title =

Shan, G. , title =. BMC Medical Informatics and Decision Making , volume =

work page
[16]

and Cooper, W.O

Elmessiry, A. and Cooper, W.O. and Catron, T.F. and Karrass, J. and Zhang, Z. and Singh, M.P. , title =. JMIR Medical Informatics , volume =

work page
[17]

and Biedron, S.G

Pirayeshshirazinezhad, R. and Biedron, S.G. and Cruz, J.A.D. and Güitrón, S.S. and Martínez-Ramón, M. , title =. IEEE Access , volume =

work page
[18]

, title =

Pirayeshshirazinezhad, R. , title =

work page
[19]

and Cristianini, N

Shawe-Taylor, J. and Cristianini, N. , title =

work page
[20]

and Tukey, J.W

McGill, R. and Tukey, J.W. and Larsen, W.A. , title =. The American Statistician , volume =

work page
[21]

and Whitney, D.R

Mann, H.B. and Whitney, D.R. , title =. The Annals of Mathematical Statistics , pages =

work page
[22]

McClenny, U

McClenny, Levi and Braga-Neto, Ulisses , title =. 2009.04544 , archivePrefix =

work page arXiv 2009
[23]

International Conference on Learning Representations (ICLR) , year =

Wang, Sifan and Teng, Yujun and Perdikaris, Paris , title =. International Conference on Learning Representations (ICLR) , year =

work page
[24]

2102.06754 , archivePrefix =

Yang, Liu and Meng, Xuhui and Karniadakis, George Em , title =. 2102.06754 , archivePrefix =

work page arXiv
[25]

IEEe Access , volume=

Designing Monte Carlo simulation and an optimal machine learning to optimize and model space missions , author=. IEEe Access , volume=. 2022 , publisher=

work page 2022
[26]

arXiv preprint arXiv:2509.15491 , year=

Explainable AI-Enhanced Supervisory Control for Robust Multi-Agent Robotic Systems , author=. arXiv preprint arXiv:2509.15491 , year=

work page arXiv
[27]

2022 , institution=

Machine Learning-Based Tuning of Control Parameters for LLRF System of Superconducting Cavities , author=. 2022 , institution=

work page 2022
[28]

2022 , school=

Artificial intelligence, controls, and sensor fusion for optimization and modeling of space missions and particle accelerators , author=. 2022 , school=

work page 2022
[29]

arXiv preprint arXiv:1910.07648 , year=

Studies in applying machine learning to LLRF and resonance control in superconducting RF cavities , author=. arXiv preprint arXiv:1910.07648 , year=

work page arXiv 1910
[30]

12th International Particle Accelerator Conference (IPAC'21), Campinas, SP, Brazil, 24-28 May 2021 , pages=

Hierarchical intelligent real-time optimal control for LLRF using time series machine learning methods and transfer learning , author=. 12th International Particle Accelerator Conference (IPAC'21), Campinas, SP, Brazil, 24-28 May 2021 , pages=. 2021 , organization=

work page 2021

[1] [1]

and Kahani, M

Baghban, A. and Kahani, M. and Nazari, M.A. and Ahmadi, M.H. and Yan, W.-M. , title =. International Journal of Heat and Mass Transfer , volume =

work page

[2] [2]

and Bahadori, M

Baghban, A. and Bahadori, M. and Rozyn, J. and Lee, M. and Abbas, A. and Bahadori, A. and Rahimali, A. , title =. Applied Thermal Engineering , volume =

work page

[3] [3]

and Kayfeci, M

Kurt, H. and Kayfeci, M. , title =. Applied Energy , volume =

work page

[4] [4]

and Karimi, H

Yousefi, F. and Karimi, H. and Papari, M.M. , title =. Journal of Molecular Liquids , volume =

work page

[5] [5]

and Güemes, A

Guastoni, L. and Güemes, A. and Ianiro, A. and Discetti, S. and Schlatter, P. and Azizpour, H. and Vinuesa, R. , title =. Journal of Fluid Mechanics , volume =

work page

[6] [6]

Energies , volume =

Sharma, Pushan and Chung, Wai Tong and Akoush, Bassem and Ihme, Matthias , title =. Energies , volume =

work page

[7] [7]

and Lee, J

Jeon, J. and Lee, J. and Kim, S.J. , title =. International Journal of Energy Research , volume =

work page

[8] [8]

and Lee, J

Jeon, J. and Lee, J. and Eivazi, H. and Vinuesa, R. and Kim, S.J. , title =. 2206.06817 , archivePrefix =

work page arXiv

[9] [9]

and Qi, Z

Zhuang, F. and Qi, Z. and Duan, K. and Xi, D. and Zhu, Y. and Zhu, H. and Xiong, H. and He, Q. , title =. Proceedings of the IEEE , volume =

work page

[10] [10]

and Chen, K

Liu, J. and Chen, K. and Xu, G. and Sun, X. and Yan, M. and Diao, W. and Han, H. , title =. IEEE Geoscience and Remote Sensing Letters , volume =

work page

[11] [11]

and Biedron, S

Pirayesh, R. and Biedron, S. and Cruz, J.A.D. and Güitrón, S.S. and Martínez-Ramón, M. , title =. Preprint , year =

work page

[12] [12]

arXiv preprint arXiv:2509.05886 , year=

SPINN: An Optimal Self-Supervised Physics-Informed Neural Network Framework , author=. arXiv preprint arXiv:2509.05886 , year=

work page arXiv

[13] [13]

and Verma, M.K

Bhattacharya, S. and Verma, M.K. and Bhattacharya, A. , title =. Physics of Fluids , volume =

work page

[14] [14]

and Mahian, O

Tafarroj, M.M. and Mahian, O. and Kasaeian, A. and Sakamatapan, K. and Dalkilic, A.S. and Wongwises, S. , title =. International Communications in Heat and Mass Transfer , volume =

work page

[15] [15]

, title =

Shan, G. , title =. BMC Medical Informatics and Decision Making , volume =

work page

[16] [16]

and Cooper, W.O

Elmessiry, A. and Cooper, W.O. and Catron, T.F. and Karrass, J. and Zhang, Z. and Singh, M.P. , title =. JMIR Medical Informatics , volume =

work page

[17] [17]

and Biedron, S.G

Pirayeshshirazinezhad, R. and Biedron, S.G. and Cruz, J.A.D. and Güitrón, S.S. and Martínez-Ramón, M. , title =. IEEE Access , volume =

work page

[18] [18]

, title =

Pirayeshshirazinezhad, R. , title =

work page

[19] [19]

and Cristianini, N

Shawe-Taylor, J. and Cristianini, N. , title =

work page

[20] [20]

and Tukey, J.W

McGill, R. and Tukey, J.W. and Larsen, W.A. , title =. The American Statistician , volume =

work page

[21] [21]

and Whitney, D.R

Mann, H.B. and Whitney, D.R. , title =. The Annals of Mathematical Statistics , pages =

work page

[22] [22]

McClenny, U

McClenny, Levi and Braga-Neto, Ulisses , title =. 2009.04544 , archivePrefix =

work page arXiv 2009

[23] [23]

International Conference on Learning Representations (ICLR) , year =

Wang, Sifan and Teng, Yujun and Perdikaris, Paris , title =. International Conference on Learning Representations (ICLR) , year =

work page

[24] [24]

2102.06754 , archivePrefix =

Yang, Liu and Meng, Xuhui and Karniadakis, George Em , title =. 2102.06754 , archivePrefix =

work page arXiv

[25] [25]

IEEe Access , volume=

Designing Monte Carlo simulation and an optimal machine learning to optimize and model space missions , author=. IEEe Access , volume=. 2022 , publisher=

work page 2022

[26] [26]

arXiv preprint arXiv:2509.15491 , year=

Explainable AI-Enhanced Supervisory Control for Robust Multi-Agent Robotic Systems , author=. arXiv preprint arXiv:2509.15491 , year=

work page arXiv

[27] [27]

2022 , institution=

Machine Learning-Based Tuning of Control Parameters for LLRF System of Superconducting Cavities , author=. 2022 , institution=

work page 2022

[28] [28]

2022 , school=

Artificial intelligence, controls, and sensor fusion for optimization and modeling of space missions and particle accelerators , author=. 2022 , school=

work page 2022

[29] [29]

arXiv preprint arXiv:1910.07648 , year=

Studies in applying machine learning to LLRF and resonance control in superconducting RF cavities , author=. arXiv preprint arXiv:1910.07648 , year=

work page arXiv 1910

[30] [30]

12th International Particle Accelerator Conference (IPAC'21), Campinas, SP, Brazil, 24-28 May 2021 , pages=

Hierarchical intelligent real-time optimal control for LLRF using time series machine learning methods and transfer learning , author=. 12th International Particle Accelerator Conference (IPAC'21), Campinas, SP, Brazil, 24-28 May 2021 , pages=. 2021 , organization=

work page 2021