Neural surrogates for crystal growth dynamics with variable supersaturation: explicit vs. implicit conditioning
Pith reviewed 2026-05-09 21:45 UTC · model grok-4.3
The pith
Explicit conditioning on supersaturation produces higher-fidelity neural surrogates for crystal growth than implicit inference from short input sequences.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Convolutional recurrent networks can act as accurate surrogates for Allen-Cahn crystal-growth dynamics when supersaturation is supplied explicitly as an input parameter, reproducing both global growth rates and local faceted morphology with high fidelity; the same networks can also infer the parameter from a short input sequence, but require substantially more training data to reach comparable accuracy.
What carries the argument
Convolutional recurrent neural networks that either accept supersaturation as an explicit scalar input or infer it implicitly by processing a short input mini-sequence before predicting the continuation.
If this is right
- Explicit conditioning reproduces ground-truth crystal profiles with high fidelity across the tested supersaturation range even when the training set is small.
- Implicit mini-sequence conditioning reaches comparable accuracy only after the training dataset is enlarged.
- Trained models remain accurate on spatial domains 256 times larger than those used in training.
- Prediction error stays limited when the generated sequences are extended more than ten times beyond the training length.
- Both architectures consistently capture the global effect of supersaturation on growth speed and its local effect on facet formation.
Where Pith is reading between the lines
- The same explicit-conditioning strategy could be applied to other phase-field models to create fast surrogates for different material systems or driving forces.
- Hybrid networks that combine a short implicit check with explicit parameter input might remain robust when the true supersaturation is only known approximately.
- Because error accumulates slowly, the surrogates could support long-time optimization loops in which supersaturation is varied to achieve desired crystal shapes.
Load-bearing premise
The numerical Allen-Cahn solutions with kinetic anisotropy form a representative, noise-free distribution that lets the networks learn dynamics capable of generalizing to new supersaturation values, domain sizes, and sequence lengths.
What would settle it
Running a fresh numerical integration of the Allen-Cahn equation at a supersaturation value held out from training and finding that the explicit model's predicted interface positions deviate by more than a few percent from that reference solution.
Figures
read the original abstract
Simulations of crystal growth are performed by using Convolutional Recurrent Neural Network surrogate models, trained on a dataset of time sequences computed by numerical integration of Allen-Cahn dynamics including faceting via kinetic anisotropy. Two network architectures are developed to take into account the effects of a variable supersaturation value. The first infers it implicitly by processing an input mini-sequence of a few evolution frames and then returns a consistent continuation of the evolution. The second takes the supersaturation parameter as an explicit input along with a single initial frame and predicts the entire sequence. The two models are systematically tested to establish strengths and weaknesses, comparing the prediction performance for models trained on datasets of different size and, in the first architecture, different lengths of input mini-sequence. The analysis of point-wise and mean absolute errors shows how the explicit parameter conditioning guarantees the best results, reproducing with high-fidelity the ground-truth profiles. Comparable results are achievable by the mini-sequence approach only when using larger training datasets. The trained models show strong conditioning by the supersaturation parameter, consistently reproducing its overall impact on growth rates as well as its local effect on the faceted morphology. Moreover, they are perfectly scalable even on 256 times larger domains and can be successfully extended to more than 10 times longer sequences with limited error accumulation. The analysis highlights the potential and limits of these approaches in view of their general exploitation for crystal growth simulations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops two convolutional recurrent neural network (CRNN) surrogate models for crystal growth dynamics governed by the Allen-Cahn equation with kinetic anisotropy and variable supersaturation. One architecture conditions implicitly by ingesting a short input mini-sequence of frames before predicting continuation; the other conditions explicitly by receiving the supersaturation value together with a single initial frame and predicting the full sequence. Both are trained on numerical simulation trajectories and compared via point-wise and mean-absolute-error metrics across training-set sizes, with additional tests of scalability to 256-fold larger domains and >10-fold longer sequences.
Significance. If the reported error reductions and scalability hold under rigorous out-of-distribution testing, the explicit-conditioning architecture would supply a practical, high-fidelity surrogate that captures both global growth-rate dependence and local faceting morphology, offering substantial computational savings for materials-science simulations that must explore many supersaturation values.
major comments (3)
- [Results (error analysis and dataset-size experiments)] The central claim that explicit conditioning 'guarantees the best results' and reproduces ground-truth profiles with high fidelity rests on error comparisons across dataset sizes, yet the manuscript supplies no information on the discrete or continuous sampling of supersaturation values, the precise train/test partition with respect to those parameter values, or quantitative out-of-distribution metrics. Without these details the observed advantage could be in-distribution interpolation rather than true generalization.
- [Scalability and long-sequence extension tests] The scalability assertions—perfect extrapolation to 256× domain size and >10× sequence length with only limited error accumulation—are presented without tabulated MAE values, error-growth curves, or direct comparison against the implicit model on the same extrapolated regimes. These numbers are load-bearing for the claim that the surrogates are 'perfectly scalable'.
- [Methods (dataset generation) and Discussion] The weakest-assumption concern is not addressed: the training trajectories are generated from a finite set of Allen-Cahn solutions with kinetic anisotropy; if supersaturation values are sparsely or correlatedly sampled, low in-distribution MAE does not guarantee correct local faceting response or growth-rate scaling for truly unseen supersaturations. No ablation on supersaturation density or morphology diversity is reported.
minor comments (2)
- [Model architectures] Notation for the two architectures is introduced clearly in the abstract but would benefit from a short table in the main text that lists input dimensionality, conditioning mechanism, and output horizon for each model.
- [Figures] Figure captions for error heat-maps or morphology snapshots should explicitly state the supersaturation value(s) shown and whether they lie inside or outside the training range.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which help clarify the presentation of our results. We address each major comment below and indicate the revisions that will be incorporated into the next version of the manuscript.
read point-by-point responses
-
Referee: [Results (error analysis and dataset-size experiments)] The central claim that explicit conditioning 'guarantees the best results' and reproduces ground-truth profiles with high fidelity rests on error comparisons across dataset sizes, yet the manuscript supplies no information on the discrete or continuous sampling of supersaturation values, the precise train/test partition with respect to those parameter values, or quantitative out-of-distribution metrics. Without these details the observed advantage could be in-distribution interpolation rather than true generalization.
Authors: We agree that these details are necessary to substantiate the generalization claim. The revised manuscript will include a complete description of the supersaturation sampling procedure used to generate the training trajectories, the exact train/test partitioning with respect to supersaturation values, and quantitative out-of-distribution error metrics for supersaturation values held out from training. These additions will demonstrate that the explicit-conditioning advantage is not limited to in-distribution interpolation. revision: yes
-
Referee: [Scalability and long-sequence extension tests] The scalability assertions—perfect extrapolation to 256× domain size and >10× sequence length with only limited error accumulation—are presented without tabulated MAE values, error-growth curves, or direct comparison against the implicit model on the same extrapolated regimes. These numbers are load-bearing for the claim that the surrogates are 'perfectly scalable'.
Authors: We acknowledge that the current presentation relies on qualitative statements. The revised manuscript will add tabulated MAE values for the 256-fold domain scaling and >10-fold sequence extension experiments, together with error-growth curves over time. Direct comparisons between the explicit and implicit models on these extrapolated regimes will also be included to quantify the limited error accumulation. revision: yes
-
Referee: [Methods (dataset generation) and Discussion] The weakest-assumption concern is not addressed: the training trajectories are generated from a finite set of Allen-Cahn solutions with kinetic anisotropy; if supersaturation values are sparsely or correlatedly sampled, low in-distribution MAE does not guarantee correct local faceting response or growth-rate scaling for truly unseen supersaturations. No ablation on supersaturation density or morphology diversity is reported.
Authors: This concern is valid and will be addressed. The revised Methods section will expand on the dataset generation process, and we will add an ablation study that varies supersaturation sampling density while measuring both in-distribution and out-of-distribution performance on faceting morphology and growth-rate scaling. The Discussion will be updated to interpret these results in light of the finite training set. revision: yes
Circularity Check
No circularity: purely empirical ML surrogate evaluation on independent simulations
full rationale
The paper trains and evaluates convolutional recurrent neural networks on time sequences generated by numerical integration of the Allen-Cahn equation with kinetic anisotropy. Explicit vs. implicit supersaturation conditioning is compared via point-wise and mean absolute errors on held-out ground-truth trajectories, with additional tests for scalability to larger domains and longer sequences. No load-bearing steps reduce to self-definition, fitted inputs renamed as predictions, or self-citation chains; results follow directly from standard train/test splits on external simulation data without any derivation that equates outputs to inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Allen-Cahn dynamics with kinetic anisotropy accurately capture the essential physics of faceted crystal growth under variable supersaturation
Reference graph
Works this paper leans on
-
[1]
J. Wei, X. Chu, X.-Y. Sun, K. Xu, H.-X. Deng, J. Chen, Z. Wei, and M. Lei, Machine learning in materials science, InfoMat1, 338 (2019)
work page 2019
-
[2]
K. Choudhary, B. DeCost, C. Chen, A. Jain, F. Tavazza, R. Cohn, C. W. Park, A. Choudhary, A. Agrawal, S. J. Billinge,et al., Recent advances and applications of deep learning methods in materials science, npj Computa- tional Materials8, 59 (2022)
work page 2022
-
[3]
J. Schmidt, M. R. Marques, S. Botti, and M. A. Marques, Recent advances and applications of machine learning in solid-state materials science, npj computational materi- als5, 83 (2019)
work page 2019
-
[4]
I. Peivaste, S. Belouettar, F. Mercuri, N. Fantuzzi, H. Dehghani, R. Izadi, H. Ibrahim, J. Lengiewicz, M. Belouettar-Mathis, K. Bendine,et al., Artificial in- telligence in materials science and engineering: Current landscape, key challenges, and future trajectories, Com- posite Structures372, 119419 (2025)
work page 2025
-
[5]
P. Lyngby and K. S. Thygesen, Data-driven discovery of 2D materials by deep generative models, npj Computa- tional Materials8, 232 (2022)
work page 2022
-
[6]
Y. Zhao, E. M. D. Siriwardane, Z. Wu, N. Fu, M. Al- Fahdi, M. Hu, and J. Hu, Physics guided deep learning for generative design of crystal materials with symmetry constraints, npj Computational Materials9, 38 (2023)
work page 2023
-
[7]
E. T. Chenebuah, M. Nganbe, and A. B. Tchagang, A deep generative modeling architecture for designing lattice-constrained perovskite materials, npj Computa- tional Materials10, 198 (2024)
work page 2024
-
[8]
C. Karpovich, E. Pan, and E. A. Olivetti, Deep reinforce- ment learning for inverse inorganic materials design, npj Computational Materials10, 287 (2024)
work page 2024
-
[9]
N. Provatas and K. Elder,Phase-Field Methods in Ma- terials Science and Engineering, 1st ed. (Wiley, 2010)
work page 2010
-
[10]
B. Li, J. Lowengrub, A. Ratz, and A. Voigt, Geomet- ric Evolution Laws for Thin Crystalline Films: Mod- eling and Numerics, Communications in Computational Physics6, 433 (2009)
work page 2009
-
[11]
D. Montes De Oca Zapiain, J. A. Stewart, and R. Dingre- ville, Accelerating phase-field-based microstructure evo- lution predictions via surrogate models trained by ma- chine learning methods, npj Computational Materials7, 3 (2021)
work page 2021
-
[12]
C. Hu, S. Martin, and R. Dingreville, Accelerating phase- field predictions via recurrent neural networks learning the microstructure evolution in latent space, Computer Methods in Applied Mechanics and Engineering397, 115128 (2022)
work page 2022
- [13]
- [14]
-
[15]
K. Alhada–Lahbabi, D. Deleruyelle, and B. Gautier, Ma- chine learning surrogate for 3D phase-field modeling of ferroelectric tip-induced electrical switching, npj Com- putational Materials10, 197 (2024)
work page 2024
-
[16]
C. Bonneville, N. Bieberdorf, P. Robbe, M. Asta, H. Najm, L. Capolungo, and C. Safta, Towards spatio- temporal extrapolation of phase-field simulations with convolution-only neural networks (2026), 2601.04510
-
[17]
I. Peivaste, A. Makradi, and S. Belouettar, Teaching ar- tificial intelligence to perform rapid, resolution-invariant grain growth modeling via fourier neural operator, Com- puter Methods in Applied Mechanics and Engineering 440, 117945 (2025)
work page 2025
-
[18]
S. Fan, A. L. Hitt, M. Tang, B. Sadigh, and F. Zhou, Ac- celerate microstructure evolution simulation using graph neural networks with adaptive spatiotemporal resolution, Machine Learning: Science and Technology5, 025027 (2024)
work page 2024
-
[19]
D. Lanzoni, F. Montalenti, and R. Bergamaschini, Deep learning for simulating the evolution of condensed matter systems at the continuum scale: Methods and applica- tions, Journal of Physics: Condensed Matter37, 403003 (2025)
work page 2025
-
[20]
X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-k. Wong, and W.-c. Woo, Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting (2015), 1506.04214
work page Pith review arXiv 2015
-
[21]
Delving Deeper into Convolutional Networks for Learning Video Representations
N. Ballas, L. Yao, C. Pal, and A. Courville, Delving Deeper into Convolutional Networks for Learning Video Representations (2016), arXiv:1511.06432
work page Pith review arXiv 2016
-
[22]
K. Yang, Y. Cao, Y. Zhang, S. Fan, M. Tang, D. Aberg, B. Sadigh, and F. Zhou, Self-supervised learning and pre- diction of microstructure evolution with convolutional re- current neural networks, Patterns2, 100243 (2021)
work page 2021
-
[23]
P. Wu, A. S. Iquebal, and K. Ankit, Emulating mi- crostructural evolution during spinodal decomposition using a tensor decomposed convolutional and recurrent neural network, Computational Materials Science224, 112187 (2023)
work page 2023
-
[24]
D. Lanzoni, A. Fantasia, R. Bergamaschini, O. Pierre- Louis, and F. Montalenti, Extreme time extrapolation capabilities and thermodynamic consistency of physics- inspired neural networks for the 3D microstructure evolu- tion of materials via Cahn–Hilliard flow, Machine Learn- ing: Science and Technology5, 045017 (2024)
work page 2024
-
[25]
A. Fantasia, D. Lanzoni, N. Di Eugenio, A. Monteleone, R. Bergamaschini, and F. Montalenti, A parametrically- Conditioned Deep Learning Surrogate for Coherent Spin- odal Decomposition, Advanced Theory and Simulations 9, e02144 (2026)
work page 2026
-
[26]
A. A. K. Farizhandi, O. Betancourt, and M. Mamivand, Deep learning approach for chemistry and processing his- tory prediction from materials microstructure, Scientific Reports12, 4552 (2022)
work page 2022
-
[27]
D. Lanzoni, M. Albani, R. Bergamaschini, and F. Mon- talenti, Morphological evolution via surface diffusion learned by convolutional, recurrent neural networks: Ex- trapolation and prediction uncertainty, Physical Review Materials6, 103801 (2022)
work page 2022
-
[28]
K. Kutsukake, Review of machine learning applications for crystal growth research, Journal of Crystal Growth 630, 127598 (2024)
work page 2024
-
[29]
M. Petkovic, L. Vieira, and N. Dropka, Machine learn- ing in crystal growth: A review of methods, data, and applications, Progress in Crystal Growth and Character- ization of Materials71, 100689 (2025). 13
work page 2025
-
[30]
M. Lu, S. Rao, H. Yue, J. Han, and J. Wang, Recent advances in the application of machine learning to crys- tal behavior and crystallization process control, Crystal Growth & Design24, 5374 (2024)
work page 2024
-
[31]
Y. Lu, X. Yang, B. Niu, T. Li, Z. Zheng, L. Jia, D. Chen, H. Qi, K. H. L. Zhang, M. Zhu, H. Zhang, and X. Lu, Achievement of high-quality gallium oxide epitaxial growth via machine learning, Advanced Func- tional Materials36, e19854 (2026)
work page 2026
-
[32]
S. M. Allen and J. W. Cahn, A microscopic theory for an- tiphase boundary motion and its application to antiphase domain coarsening, Acta Metallurgica27, 1085 (1979)
work page 1979
-
[33]
D. P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization (2017), arXiv:1412.6980
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[34]
Y. Bengio, J. Louradour, R. Collobert, and J. Weston, Curriculum learning, inProceedings of the 26th Annual International Conference on Machine Learning, ICML ’09 (2009) pp. 41–48. SUPPLEMENTARY MATERIAL Neural surrogates for crystal growth dynamics with variable supersaturation: explicit vs. implicit conditioning Matteo Rigoni, 1 Daniele Lanzoni, 1, 2 Fr...
work page 2009
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.