Physics-Informed Attention Mechanism and Generalization Capability of Deep Learning-Based Grain Growth Evolution Prediction

Marc Bernacki; Pungponhavoan Tep

arxiv: 2606.17235 · v1 · pith:7UFGYZSQnew · submitted 2026-06-15 · ❄️ cond-mat.mtrl-sci · cs.AI

Physics-Informed Attention Mechanism and Generalization Capability of Deep Learning-Based Grain Growth Evolution Prediction

Pungponhavoan Tep , Marc Bernacki This is my paper

Pith reviewed 2026-06-27 02:36 UTC · model grok-4.3

classification ❄️ cond-mat.mtrl-sci cs.AI

keywords grain growthdeep learningphysics-informed attentionout-of-distribution generalizationmicrostructure evolutionmaterials scienceattention mechanism

0 comments

The pith

Boundary-masked attention improves generalization of synthetic-trained grain growth models to experimental and bimodal microstructures without retraining.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper tests whether deep learning models for grain growth, trained only on idealized synthetic data, can handle real experimental microstructures and unusual grain size distributions. It adds a boundary-masked attention mechanism that limits attention to grain boundary pixels, reflecting the physics of curvature-driven growth. Both the baseline and the modified model generalize to three out-of-distribution test cases, but the physics-informed version shows clear gains, especially for bimodal grain distributions where SSIM rises from 0.6221 to 0.7609 and mean grain size error drops from 8.75% to 3.57%. Attention maps reveal the model focuses on large boundaries in a manner consistent with known grain growth physics.

Core claim

Both the baseline and the proposed physics-informed attention model were evaluated without retraining or fine-tuning on the OOD data. Both models successfully generalized to all three test cases, yet the boundary-masked attention mechanism provided substantial improvements, with the most notable gains for microstructures characterized by a bimodal grain size distribution, where Structural Similarity Index Measure (SSIM) improved from 0.6221 to 0.7609 and mean grain size error decreased from 8.75% to 3.57%. The attention heatmap analysis revealed that the boundary-masked attention model learned to concentrate attention on large grain boundaries in a manner consistent with curvature-driven gra

What carries the argument

The boundary-masked attention mechanism, which constrains attention to grain boundary pixels to incorporate grain growth physics into the model.

If this is right

Models trained solely on synthetic grain growth data can be applied directly to experimental microstructures.
Physics-informed attention yields the largest accuracy gains on microstructures whose grain size distribution differs from the training set.
Attention patterns consistent with curvature-driven growth emerge automatically during training on synthetic data.
Similar boundary constraints could be added to other deep learning models for microstructure evolution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Attention masking may offer a lightweight route to embed physical constraints in other materials simulation models without changing the loss function.
The approach could reduce reliance on repeated retraining when grain growth conditions shift in industrial processing.
Extending the mask to include additional physical features such as triple junctions might produce further robustness gains.

Load-bearing premise

The three chosen out-of-distribution test cases and the synthetic training data are representative of real deployment conditions, and SSIM plus mean grain size error adequately measure physical fidelity.

What would settle it

A new set of experimental grain growth microstructures outside the three tested OOD cases where the boundary-masked model shows equal or lower accuracy than the baseline on SSIM and grain size error.

Figures

Figures reproduced from arXiv: 2606.17235 by Marc Bernacki, Pungponhavoan Tep.

**Figure 2.** Figure 2: Microstructure images for Test Case 1 (experimental microstructures): (a) [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

**Figure 3.** Figure 3: Microstructure images for Test Case 2 (synthetic bimodal microstructures): [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗

**Figure 4.** Figure 4: Microstructure images for Test Case 3 (abnormal grain growth): (a) initial state [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

**Figure 5.** Figure 5: Initial state distributions (t = 0 min) for all three OOD test cases, where left panels show surface-weighted ECR distributions and right panels show grain neighbor count distributions: (a, b) Test Case 1 (experimental microstructures); (c, d) Test Case 2 (synthetic bimodal microstructures); (e, f) Test Case 3 (abnormal grain growth). which characterizes network connectivity by recording the number of boun… view at source ↗

**Figure 6.** Figure 6: Error heatmap for Test Case 1 (experimental microstructures) at [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗

**Figure 7.** Figure 7: Distributions comparison between predicted and ground truth for Test Case 1 [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗

**Figure 8.** Figure 8: Error heatmap for Test Case 2 (synthetic bimodal microstructures) at [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗

**Figure 9.** Figure 9: Distributions comparison between predicted and ground truth for Test Case 2 [PITH_FULL_IMAGE:figures/full_fig_p017_9.png] view at source ↗

**Figure 10.** Figure 10: Error heatmap for Test Case 3 (abnormal grain growth) at [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗

**Figure 11.** Figure 11: Distributions comparison between predicted and ground truth for Test Case 3 [PITH_FULL_IMAGE:figures/full_fig_p018_11.png] view at source ↗

**Figure 12.** Figure 12: Attention weight heatmaps generated by the boundary-masked attention mech [PITH_FULL_IMAGE:figures/full_fig_p027_12.png] view at source ↗

**Figure 13.** Figure 13: Evolution of the neighbor count for the single abnormal grain in Test Case 3 [PITH_FULL_IMAGE:figures/full_fig_p028_13.png] view at source ↗

read the original abstract

Machine Learning (ML) models for grain growth prediction are typically trained on idealized synthetic data, yet practical applications require generalization to conditions outside the training distribution. This study evaluated the Out-Of-Distribution (OOD) generalization capability of the trained model from our previous study across three test cases, including experimental microstructures, microstructures characterized by a bimodal grain size distribution, and abnormal grain growth. To further probe whether physics-informed architectural design could improve robustness under these different conditions, a boundary-masked attention mechanism was proposed specifically for grain growth, constraining attention to grain boundary pixels. Both the baseline and the proposed physics-informed attention model were evaluated without retraining or fine-tuning on the OOD data. Both models successfully generalized to all three test cases, yet the boundary-masked attention mechanism provided substantial improvements, with the most notable gains for microstructures characterized by a bimodal grain size distribution, where Structural Similarity Index Measure (SSIM) improved from \num{0.6221} to \num{0.7609} and mean grain size ($\overline{R}$) error decreased from \SI{8.75}{\percent} to \SI{3.57}{\percent}. The attention heatmap analysis revealed that the boundary-masked attention model learned to concentrate attention on large grain boundaries in a manner consistent with curvature-driven grain growth physics, emerging from training without being explicitly encoded into the architecture. These results indicate that models trained on synthetic data can generalize to diverse OOD conditions without retraining, and that physics-informed attention may improve accuracy when the boundary morphology matches the training domain.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Boundary-masked attention improves SSIM and size error on OOD grain growth cases, but the gains rest on image metrics rather than direct checks on curvature-driven kinetics.

read the letter

The paper takes their prior grain growth model, adds a boundary-masked attention layer, and runs both versions on three OOD regimes without retraining or fine-tuning. The masked version shows clear lifts, especially on bimodal microstructures where SSIM moves from 0.6221 to 0.7609 and mean grain size error falls from 8.75% to 3.57%. Both models handle experimental images, bimodal distributions, and abnormal growth.

What stands out is the concrete OOD testing and the observation that attention concentrates on large boundaries in a pattern consistent with curvature-driven growth. That is a useful data point for anyone trying to move synthetic-trained models toward real microstructures.

The soft spots are straightforward. SSIM and average radius error measure visual match and one aggregate number; they do not directly test whether boundary velocity follows curvature or whether topology evolves correctly under the von Neumann-Mullins relation. The attention analysis is post-hoc and qualitative. No error bars, run-to-run statistics, or training hyperparameter details appear in the abstract, and the baseline is the authors' own earlier work. These gaps make the strength of the physics-informed claim hard to judge from the numbers given.

The work is aimed at the computational materials modeling community that already uses ML for microstructure evolution. A reader who needs to know whether attention masks help generalization on real or non-ideal grain structures will get usable information. The OOD tests are specific enough that the paper deserves a serious referee rather than a desk reject.

Referee Report

3 major / 1 minor

Summary. The manuscript evaluates the out-of-distribution (OOD) generalization of a prior deep learning model for predicting grain growth evolution, trained on synthetic data. It proposes a boundary-masked attention mechanism that constrains attention to grain boundary pixels and reports that both the baseline and proposed models generalize without retraining to experimental microstructures, bimodal grain size distributions, and abnormal grain growth cases. The attention model yields gains, most notably on bimodal cases (SSIM 0.6221 to 0.7609; mean grain size error 8.75% to 3.57%), with qualitative attention heatmaps interpreted as consistent with curvature-driven physics.

Significance. If the central claims hold under more rigorous physical validation, the work would indicate that targeted architectural constraints drawn from materials physics can enhance robustness of microstructure evolution predictors beyond training distributions, potentially reducing the need for domain-specific retraining in experimental settings.

major comments (3)

[Abstract] Abstract: The headline generalization and 'physics-informed' benefit claims rest on SSIM and mean grain size error improvements, yet these metrics quantify image similarity and a single scalar aggregate rather than direct adherence to curvature-driven kinetics (e.g., von Neumann-Mullins relation dA/dt = k(n-6) or boundary velocity proportional to curvature); no such physical fidelity tests are reported, leaving the interpretation of the gains unsecured.
[Abstract] Abstract: The reported metric gains (SSIM 0.6221→0.7609 and mean grain size error 8.75%→3.57% on bimodal cases) are given without error bars, number of microstructures evaluated, or statistical significance tests, which is load-bearing for asserting 'substantial improvements' and successful generalization across all three OOD regimes.
[Abstract] Abstract: The attention heatmap analysis is described as showing concentration on large grain boundaries 'in a manner consistent with curvature-driven grain growth physics' emerging without explicit encoding, but the analysis is post-hoc and qualitative with no quantitative metric (e.g., correlation with local curvature or boundary velocity) supplied to support this interpretation.

minor comments (1)

The abstract references 'our previous study' for the baseline without restating key architectural or training details here; ensure the full manuscript makes the comparison fully self-contained for readers.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript arXiv:2606.17235. We address each of the major comments below and outline the revisions we will make to strengthen the paper.

read point-by-point responses

Referee: [Abstract] The headline generalization and 'physics-informed' benefit claims rest on SSIM and mean grain size error improvements, yet these metrics quantify image similarity and a single scalar aggregate rather than direct adherence to curvature-driven kinetics (e.g., von Neumann-Mullins relation); no such physical fidelity tests are reported.

Authors: We acknowledge that our evaluation relies on SSIM and mean grain size error, which are standard metrics for assessing prediction accuracy in image-based models but do not directly verify adherence to specific physical laws such as the von Neumann-Mullins relation. The primary goal of the study was to demonstrate OOD generalization using these practical metrics. However, we agree that this leaves the physics interpretation somewhat unsecured. In the revised manuscript, we will add a paragraph in the discussion section acknowledging this limitation and suggesting that future work could include direct comparisons to curvature-driven models. We will also tone down the abstract claims to reflect the metrics used. revision: yes
Referee: [Abstract] The reported metric gains (SSIM 0.6221→0.7609 and mean grain size error 8.75%→3.57% on bimodal cases) are given without error bars, number of microstructures evaluated, or statistical significance tests.

Authors: This is a valid point. The values reported in the abstract are mean values across the evaluated OOD cases, but we omitted the supporting details for conciseness. We will revise the abstract and the results section to specify the number of microstructures tested for each OOD regime, include error bars or standard deviations, and report any statistical significance tests that were performed on the improvements. revision: yes
Referee: [Abstract] The attention heatmap analysis is described as showing concentration on large grain boundaries 'in a manner consistent with curvature-driven grain growth physics' emerging without explicit encoding, but the analysis is post-hoc and qualitative with no quantitative metric supplied.

Authors: We agree that the attention analysis is qualitative and post-hoc. It was included to offer insight into why the boundary-masked attention improves performance, based on visual examination of the heatmaps. To address this, we will revise the relevant text to clearly state that the consistency with physics is an interpretive observation rather than a quantitatively validated claim. If time permits, we may explore adding a simple quantitative correlation metric between attention weights and local curvature in the revision. revision: partial

Circularity Check

0 steps flagged

No circularity: OOD generalization claims rest on direct empirical testing of independent data

full rationale

The paper trains or re-uses models on synthetic data and then measures SSIM and mean grain size error on three separate OOD test sets (experimental microstructures, bimodal distributions, abnormal grain growth) without retraining. These metrics are computed on held-out inputs and do not reduce to the training distribution by construction. The boundary-masked attention is an explicit architectural modification whose performance difference is reported as an empirical observation. Self-citation of the baseline model supplies the starting point but does not carry the load-bearing claim; the new evidence consists of the OOD results themselves. No self-definitional loop, fitted-input-as-prediction, uniqueness theorem, or ansatz smuggling appears in the reported chain.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are identifiable from the abstract; the work relies on standard deep learning practices and prior model from the authors' previous study.

pith-pipeline@v0.9.1-grok · 5818 in / 1349 out tokens · 67691 ms · 2026-06-27T02:36:08.527343+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 1 linked inside Pith

[1]

M. Yang, L. Wang, W. Yan, Phase-field modeling of grain evolutions in additive manufacturing from nucleation, growth, to coarsening, npj Comput. Mater. 7 (1) (2021) 56

2021
[2]

Moelans, B

N. Moelans, B. Blanpain, P. Wollants, Quantitative analysis of grain boundary prop- erties in a generalized phase field model for grain growth in anisotropic systems, Phys. Rev. B 78 (2) (2008) 024113

2008
[3]

C. E. Krill III, L.-Q. Chen, Computer simulation of 3-d grain growth using a phase- field model, Acta Mater. 50 (12) (2002) 3059–3075

2002
[4]

Chen, Phase-field models for microstructure evolution, Annu

L.-Q. Chen, Phase-field models for microstructure evolution, Annu. Rev. Mater. Res. 32 (2002) 113–140

2002
[5]

Steinbach, Phase-field models in materials science, Modell

I. Steinbach, Phase-field models in materials science, Modell. Simul. Mater. Sci. Eng. 17 (7) (2009) 073001

2009
[7]

Murgas, S

B. Murgas, S. Florez, N. Bozzolo, J. Fausty, M. Bernacki, Comparative study and limits of different level-set formulations for the modeling of anisotropic grain growth, Mater. 14 (14) (2021) 3883

2021
[8]

P. Tep, M. Bernacki, High-fidelity grain growth modeling: Leveraging deep learning for fast computations, Acta Mater. 301 (2025) 121486

2025
[9]

Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, Image quality assessment: from error visibility to structural similarity, IEEE Trans. Image Process. 13 (4) (2004) 600– 612

2004
[10]

Z. Shen, J. Liu, Y. He, X. Zhang, R. Xu, H. Yu, P. Cui, Towards out-of-distribution generalization: A survey, arXiv preprint arXiv:2108.13624 (2021)

arXiv 2021
[11]

K. Li, A. N. Rubungo, X. Lei, D. Persaud, K. Choudhary, B. DeCost, A. B. Dieng, J. Hattrick-Simpers, Probing out-of-distribution generalization in machine learning for materials, Commun. Mater. 6 (1) (2024)

2024
[12]

Florez, K

S. Florez, K. Alvarado, D. P. Muñoz, M. Bernacki, A novel highly efficient lagrangian model for massively multidomain simulation applied to microstructural evolutions, Comput. Methods Appl. Mech. Eng. 367 (2020) 113107. 25

2020
[13]

Florez, J

S. Florez, J. Fausty, K. Alvarado, B. Murgas, M. Bernacki, Parallelization of an efficient 2d-lagrangian model for massive multi-domain simulations, Modell. Simul. Mater. Sci. Eng. 29 (6) (2021) 065005

2021
[14]

M. Bernacki, Kinetic equations and level-set approach for simulating solid-state mi- crostructure evolutions at the mesoscopic scale: State of the art, limitations, and prospects, Prog. Mater. Sci. 142 (2024) 101224

2024
[15]

Hitti, P

K. Hitti, P. Laure, T. Coupez, L. Silva, M. Bernacki, Precise generation of complex statistical representative volume elements (rves) in a finite element context, Comput. Mater. Sci. 61 (2012) 224–238

2012
[16]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I. Polosukhin, Attention is all you need, Adv. Neural Inf. Process. Syst. 30 (2017)

2017
[17]

Bahdanau, K

D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, arXiv preprint arXiv:1409.0473 (2015)

Pith/arXiv arXiv 2015
[18]

C. E. Krill, E. A. Holm, J. M. Dake, R. Cohn, K. Holikova, F. Andorfer, Extreme abnormal grain growth: Connecting mechanisms to microstructural outcomes, Annu. Rev. Mater. Res. 53 (1) (2023) 319–345

2023
[19]

Rollett, G

A. Rollett, G. S. Rohrer, J. Humphreys, Recrystallization and Related Annealing Phenomena, 3rd Edition, Elsevier, 2017

2017
[20]

Kullback, R

S. Kullback, R. A. Leibler, On information and sufficiency, Ann. Math. Stat. 22 (1) (1951) 79–86

1951
[21]

Montavon, K.-R

G. Montavon, K.-R. Müller, Learning with wasserstein distances, in: Adv. Neural Inf. Process. Syst., 2013, pp. 689–697

2013
[22]

W. W. Mullins, Two-dimensional motion of idealized grain boundaries, J. Appl. Phys. 27 (8) (1956) 900–904. 26 (a) (b) (c) (d) (e) (f) 0.00 (background) 0.15 (boundary, low attention) 0.50 0.75 1.00 (max attention) Attention Score Figure12: Attentionweightheatmapsgeneratedbytheboundary-maskedattentionmech- anism at the first prediction step (t= 10 min) and...

1956

[1] [1]

M. Yang, L. Wang, W. Yan, Phase-field modeling of grain evolutions in additive manufacturing from nucleation, growth, to coarsening, npj Comput. Mater. 7 (1) (2021) 56

2021

[2] [2]

Moelans, B

N. Moelans, B. Blanpain, P. Wollants, Quantitative analysis of grain boundary prop- erties in a generalized phase field model for grain growth in anisotropic systems, Phys. Rev. B 78 (2) (2008) 024113

2008

[3] [3]

C. E. Krill III, L.-Q. Chen, Computer simulation of 3-d grain growth using a phase- field model, Acta Mater. 50 (12) (2002) 3059–3075

2002

[4] [4]

Chen, Phase-field models for microstructure evolution, Annu

L.-Q. Chen, Phase-field models for microstructure evolution, Annu. Rev. Mater. Res. 32 (2002) 113–140

2002

[5] [5]

Steinbach, Phase-field models in materials science, Modell

I. Steinbach, Phase-field models in materials science, Modell. Simul. Mater. Sci. Eng. 17 (7) (2009) 073001

2009

[6] [7]

Murgas, S

B. Murgas, S. Florez, N. Bozzolo, J. Fausty, M. Bernacki, Comparative study and limits of different level-set formulations for the modeling of anisotropic grain growth, Mater. 14 (14) (2021) 3883

2021

[7] [8]

P. Tep, M. Bernacki, High-fidelity grain growth modeling: Leveraging deep learning for fast computations, Acta Mater. 301 (2025) 121486

2025

[8] [9]

Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, Image quality assessment: from error visibility to structural similarity, IEEE Trans. Image Process. 13 (4) (2004) 600– 612

2004

[9] [10]

Z. Shen, J. Liu, Y. He, X. Zhang, R. Xu, H. Yu, P. Cui, Towards out-of-distribution generalization: A survey, arXiv preprint arXiv:2108.13624 (2021)

arXiv 2021

[10] [11]

K. Li, A. N. Rubungo, X. Lei, D. Persaud, K. Choudhary, B. DeCost, A. B. Dieng, J. Hattrick-Simpers, Probing out-of-distribution generalization in machine learning for materials, Commun. Mater. 6 (1) (2024)

2024

[11] [12]

Florez, K

S. Florez, K. Alvarado, D. P. Muñoz, M. Bernacki, A novel highly efficient lagrangian model for massively multidomain simulation applied to microstructural evolutions, Comput. Methods Appl. Mech. Eng. 367 (2020) 113107. 25

2020

[12] [13]

Florez, J

S. Florez, J. Fausty, K. Alvarado, B. Murgas, M. Bernacki, Parallelization of an efficient 2d-lagrangian model for massive multi-domain simulations, Modell. Simul. Mater. Sci. Eng. 29 (6) (2021) 065005

2021

[13] [14]

M. Bernacki, Kinetic equations and level-set approach for simulating solid-state mi- crostructure evolutions at the mesoscopic scale: State of the art, limitations, and prospects, Prog. Mater. Sci. 142 (2024) 101224

2024

[14] [15]

Hitti, P

K. Hitti, P. Laure, T. Coupez, L. Silva, M. Bernacki, Precise generation of complex statistical representative volume elements (rves) in a finite element context, Comput. Mater. Sci. 61 (2012) 224–238

2012

[15] [16]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I. Polosukhin, Attention is all you need, Adv. Neural Inf. Process. Syst. 30 (2017)

2017

[16] [17]

Bahdanau, K

D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, arXiv preprint arXiv:1409.0473 (2015)

Pith/arXiv arXiv 2015

[17] [18]

C. E. Krill, E. A. Holm, J. M. Dake, R. Cohn, K. Holikova, F. Andorfer, Extreme abnormal grain growth: Connecting mechanisms to microstructural outcomes, Annu. Rev. Mater. Res. 53 (1) (2023) 319–345

2023

[18] [19]

Rollett, G

A. Rollett, G. S. Rohrer, J. Humphreys, Recrystallization and Related Annealing Phenomena, 3rd Edition, Elsevier, 2017

2017

[19] [20]

Kullback, R

S. Kullback, R. A. Leibler, On information and sufficiency, Ann. Math. Stat. 22 (1) (1951) 79–86

1951

[20] [21]

Montavon, K.-R

G. Montavon, K.-R. Müller, Learning with wasserstein distances, in: Adv. Neural Inf. Process. Syst., 2013, pp. 689–697

2013

[21] [22]

W. W. Mullins, Two-dimensional motion of idealized grain boundaries, J. Appl. Phys. 27 (8) (1956) 900–904. 26 (a) (b) (c) (d) (e) (f) 0.00 (background) 0.15 (boundary, low attention) 0.50 0.75 1.00 (max attention) Attention Score Figure12: Attentionweightheatmapsgeneratedbytheboundary-maskedattentionmech- anism at the first prediction step (t= 10 min) and...

1956