Mask-Morph Graph U-Net: A Generalisable Mesh-Based Surrogate for Crashworthiness Field Prediction under Large Geometric Variation
Pith reviewed 2026-05-19 16:34 UTC · model grok-4.3
pith:DHASQDOC Add to your LaTeX paper
What is a Pith Number?\usepackage{pith}
\pithnumber{DHASQDOC}
Prints a linked pith:DHASQDOC badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more
The pith
Mask-Morph Graph U-Net uses coarse-graph morphing and masked pretraining to generalise hierarchical GNNs to new mesh geometries in crash simulations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By morphing the coarsened graph hierarchy to each input mesh using feature-aligned barycentric parameterisation before constructing cross-graph edges, and applying node masking during supervised pretraining with parameter-efficient fine-tuning where high-parameter edge-specific layers are frozen, the Mask-Morph Graph U-Net retains the benefits of edge-specific layers while improving spatial correspondence and generalisability across varying graph structures for crash field prediction under large geometric variation.
What carries the argument
Coarse-graph morphing via feature-aligned barycentric parameterisation together with masked supervised pretraining inside a hierarchical Graph U-Net.
If this is right
- Coarse-graph morphing improves test accuracy relative to a fixed-coarse-graph baseline.
- Masked supervised pretraining reduces train-test discrepancy.
- Masked pretraining improves data efficiency in cross-component transfer settings.
- The model achieves lower prediction error than external baselines.
- The method supplies a practical route to reusable, data-efficient mesh-based surrogates for crashworthiness design exploration.
Where Pith is reading between the lines
- The same morphing step could be tested on other mesh-based physics problems that require transfer across shape families.
- Freezing the expensive layers after masked pretraining may lower the cost of maintaining a library of component-specific surrogates.
- If the correspondence errors remain small, the technique could be combined with gradient-based shape optimisation loops that query the surrogate thousands of times.
- Extending the masking strategy to include geometric features might further improve robustness when component topologies differ more radically.
Load-bearing premise
Feature-aligned barycentric parameterisation creates accurate enough spatial correspondence between the morphed coarse graph and each new input mesh without systematic interpolation errors that would propagate through the U-Net layers.
What would settle it
A controlled test in which the morphed-coarse-graph version produces higher mean Euclidean distance or maximum intrusion errors than the fixed-coarse-graph baseline on out-of-distribution meshes would falsify the accuracy claim.
Figures
read the original abstract
Nonlinear finite element crash simulations are accurate but computationally expensive, limiting their use in iterative design optimisation. Machine-learning surrogate models based on graph neural networks (GNNs) offer a faster alternative. Message-passing GNNs are widely used for mesh simulation, and their shared node and edge update functions are relatively generalisable across varying graph structures. By contrast, non-shareable edge-specific aggregation layers can capture nonlinear relationships more accurately but usually require fixed graph connectivity, which limits generalisability. This paper presents Mask-Morph Graph U-Net (MMGUNet), a practical approach to addressing the limitation of hierarchical Graph U-Net architectures that use edge-specific downsampling and upsampling layers. Fixed coarse graph connectivity is required for edge-specific layers. To retain this while improving spatial correspondence, the proposed method morphs the coarsened graph hierarchy to each input mesh using feature-aligned barycentric parameterisation before constructing cross-graph edges. It further applies node masking during supervised pretraining, followed by parameter-efficient fine-tuning in which high-parameter edge-specific layers are frozen. The proposed approach is evaluated in in-distribution, out-of-distribution, and cross-component transfer settings using mean Euclidean distance and maximum intrusion percentage error. Results show that coarse-graph morphing improves test accuracy relative to a fixed-coarse-graph baseline, while masked supervised pretraining reduces the train-test discrepancy and improves data efficiency during transfer. The proposed model also achieves lower prediction error compared with external baselines. These results demonstrate a practical route toward reusable, data-efficient mesh-based surrogate modelling for crashworthiness design exploration.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Mask-Morph Graph U-Net (MMGUNet), a hierarchical GNN surrogate for predicting crashworthiness fields (e.g., displacement and intrusion) on meshes with large geometric variations. It addresses the fixed-connectivity limitation of edge-specific down/upsampling layers in Graph U-Nets by morphing a fixed coarse graph hierarchy to each input mesh via feature-aligned barycentric parameterisation, then constructing cross-graph edges. Supervised pretraining with node masking is followed by parameter-efficient fine-tuning that freezes high-parameter edge-specific layers. Evaluations across in-distribution, out-of-distribution, and cross-component transfer settings report lower mean Euclidean distance and maximum intrusion percentage errors than fixed-coarse-graph baselines and external methods, attributing gains to improved spatial correspondence from morphing and reduced train-test discrepancy from masking.
Significance. If the central claims hold after addressing the noted gaps, the work offers a concrete, practical route to reusable mesh surrogates for nonlinear crash simulations. It combines explicit morphing to retain edge-specific layers with masking-based pretraining for data efficiency, which could meaningfully reduce reliance on per-geometry retraining in iterative design loops. The multi-setting evaluation (in-dist/OOD/transfer) and comparison to baselines are strengths that, once supported by error quantification and ablations, would strengthen the case for generalisable GNN surrogates in engineering.
major comments (2)
- [§3.2] §3.2 (Morphing and Cross-Graph Construction): The accuracy of feature-aligned barycentric parameterisation is not quantified (no mean/max node displacement, Hausdorff distance, or per-region error relative to input mesh resolution). This is load-bearing for the strongest claim that morphing improves test accuracy via better correspondence; without these metrics it remains unclear whether observed gains over the fixed-coarse-graph baseline arise from reduced interpolation error or from other factors, and whether residual offsets propagate through the hierarchical message-passing layers.
- [§5] §5 (Experiments and Ablations): No ablation isolates the morphing step from the masking schedule, nor reports quantitative error bars or details on train-test splits and data exclusion criteria. Because the central claims rest on morphing improving accuracy and masking improving data efficiency in transfer, the absence of these controls makes it difficult to attribute performance differences specifically to the proposed components rather than to training schedule or dataset partitioning choices.
minor comments (2)
- [§3] Notation for the morphed coarse-graph nodes and the cross-graph edge construction could be made more explicit in the methods to avoid ambiguity when readers compare against the fixed-graph baseline.
- [Figures in §5] Figure captions and axis labels in the result plots would benefit from stating the exact error metric (mean Euclidean vs. max intrusion) and whether shaded regions represent standard deviation across runs.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. We address each major comment below and indicate where revisions will be made to strengthen the manuscript.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Morphing and Cross-Graph Construction): The accuracy of feature-aligned barycentric parameterisation is not quantified (no mean/max node displacement, Hausdorff distance, or per-region error relative to input mesh resolution). This is load-bearing for the strongest claim that morphing improves test accuracy via better correspondence; without these metrics it remains unclear whether observed gains over the fixed-coarse-graph baseline arise from reduced interpolation error or from other factors, and whether residual offsets propagate through the hierarchical message-passing layers.
Authors: We agree that quantifying the morphing accuracy is necessary to substantiate the claim that improved spatial correspondence drives the observed gains. In the revised manuscript we will add explicit metrics for the feature-aligned barycentric parameterisation, including mean and maximum node displacements, Hausdorff distance, and per-region error relative to input mesh resolution. These will be reported in an expanded §3.2 and/or a new appendix, allowing readers to assess residual offsets and their potential propagation through the hierarchy. revision: yes
-
Referee: [§5] §5 (Experiments and Ablations): No ablation isolates the morphing step from the masking schedule, nor reports quantitative error bars or details on train-test splits and data exclusion criteria. Because the central claims rest on morphing improving accuracy and masking improving data efficiency in transfer, the absence of these controls makes it difficult to attribute performance differences specifically to the proposed components rather than to training schedule or dataset partitioning choices.
Authors: We concur that isolating the contributions of morphing and masking, together with statistical rigour, is required for clear attribution. The revised §5 will include (i) an ablation that decouples the morphing step from the masking pretraining schedule, (ii) quantitative error bars (standard deviation across multiple random seeds or runs), and (iii) expanded details on train-test split construction and any data exclusion criteria. These additions will directly address the concern about confounding factors. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper introduces explicit methodological components (feature-aligned barycentric morphing of coarse graphs and node masking in pretraining) whose definitions and implementations are independent of the final evaluation metrics (mean Euclidean distance and maximum intrusion percentage error). These steps are described as practical solutions to fixed-graph limitations in hierarchical GNNs, with empirical validation on held-out meshes in multiple settings. No equations, self-citations, or fitted parameters are presented as reducing the central claims to their own inputs by construction. The provided text contains no load-bearing self-references, ansatzes smuggled via citation, or renamings of known results as novel derivations.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Barycentric parameterisation produces a spatial correspondence that is sufficiently accurate for hierarchical message passing across large geometric variations.
Reference graph
Works this paper leans on
-
[1]
S. R. Wu, Convergence study on explicit finite element for crashwor- thiness analysis, Tech. Rep. 2006-01-0672, SAE International (2006). doi:10.4271/2006-01-0672
-
[2]
J. Chang, T. Tyan, M. El-Bkaily, J. Cheng, A. Marpu, Q. Zeng, J. Santini, Implicit and explicit finite element methods for crash safety analysis, Tech. Rep. 2007-01-0982, SAE International (2007). doi:10.4271/2007-01-0982
-
[3]
E. Albak, Optimization design for circular multi-cell thin-walled tubes with discrete and continuous design variables, Mechanics of Advanced Materials and Structures 30 (24) (2023) 5091–5105
work page 2023
- [4]
-
[5]
A. Ahmadi Dastjerdi, M. Moshref-Javadi, H. Ahmadian, M. Gholam- pour, Crushing analysis and multi-objective optimization of different length bi-thin walled cylindrical structures under axial impact loading, Engineering Optimization 51 (2019) 1–18
work page 2019
-
[6]
P. Zende, H. Dalir, Multi-objective optimization of composite square tube for minimizing peak crushing force and maximizing specific en- ergy absorption using artificial neural network and genetic algorithm, in: ASME International Mechanical Engineering Congress and Exposi- tion, Proceedings (IMECE), 2022
work page 2022
- [7]
-
[8]
E. Sakaridis, N. Karathanasopoulos, D. Mohr, Machine-learning based prediction of crash response of tubular structures, International Journal of Impact Engineering 166 (2022). 37
work page 2022
-
[9]
C. P. Kohar, T. K. Eller, D. S. Connolly, K. Inal, Using artificial intel- ligence to aid vehicle lightweighting in crashworthiness with aluminum, MATEC Web of Conferences 326 (2020) 01006
work page 2020
-
[10]
W. Guo, P. Xu, C. Yang, J. Guo, L. Yang, S. Yao, Machine learning- basedcrashworthinessoptimizationforthesquareconeenergy-absorbing structure of the subway vehicle, Structural and Multidisciplinary Opti- mization 66 (8) (2023) 182
work page 2023
-
[11]
C. P. Kohar, L. Greve, T. K. Eller, D. S. Connolly, K. Inal, A machine learning framework for accelerating the design process using cae simula- tions: An application to finite element analysis in structural crashwor- thiness, Computer Methods in Applied Mechanics and Engineering 385 (2021) 114008. doi:10.1016/j.cma.2021.114008
-
[12]
H. Li, H. Zhou, N. Li, An integrated convolutional neural network- based surrogate model for crashworthiness performance prediction of hot-stamped vehicle panel components, in: MATEC Web of Confer- ences, Vol. 401, 2024, p. 03013. doi:10.1051/matecconf/202440103013
-
[13]
H. Li, Y. Zhao, H. Zhou, T. Pfaff, N. Li, A graph neural network surrogate model for mesh-based crashworthiness predic- tion of vehicle panel components, Results in Engineering (2026). doi:10.1016/j.rineng.2026.110925
-
[14]
Z. Wen, Y. Li, H. Wang, Y. Peng, Data-driven spatiotemporal modeling for structural dynamics on irregular domains by stochastic dependency neural estimation, Computer Methods in Applied Mechanics and Engi- neering 404 (2023) 115831. doi:10.1016/j.cma.2022.115831
-
[15]
Y. Le Guennec, T. Defoort, J. V. Aguado, D. Borzacchiello, Compar- ing traditional surrogate modelling and neural fields for vehicle crash simulation data, in: SIA Simulation numérique 2025, 2025
work page 2025
- [16]
-
[17]
S. Thel, L. Greve, B. van de Weg, P. van der Smagt, Introducing fi- nite element method integrated networks (femin), Computer Methods in Applied Mechanics and Engineering 427 (2024) 117073. 38
work page 2024
-
[18]
S. Thel, L. Greve, M. Karl, P. van der Smagt, Accelerating crash sim- ulations with finite element method integrated networks (femin): Com- paring two approaches to replace large portions of a fem simulation, Computer Methods in Applied Mechanics and Engineering 443 (2025) 118046
work page 2025
-
[19]
G. Zhang, Y. Liu, Y. Quan, J. Yan, A mesh-based geometric deep learning framework for rapid response prediction of large-scale and multi-component mechanical structures in engineering, Computer Methods in Applied Mechanics and Engineering 448 (2026) 118435. doi:10.1016/j.cma.2025.118435
-
[20]
M. A. Nabian, S. Chavare, D. Akhare, R. Ranade, R. Cherukuri, S. Tadepalli, Automotive crash dynamics modeling accelerated with machine learning, in: SAE Technical Paper, 2025, pp. 2026–01–0568. doi:10.4271/2026-01-0568
- [21]
-
[22]
H. Wu, H. Luo, H. Wang, J. Wang, M. Long, Transolver: A fast transformer solver for pdes on general geometries, arXiv preprint arXiv:2402.02366 (2024)
work page internal anchor Pith review Pith/arXiv arXiv 2024
- [23]
-
[24]
PhysicsNeMoContributors, Nvidiaphysicsnemo: Anopen-sourceframe- work for physics-based deep learning in science and engineering, released February 24, 2023. GitHub (2023)
work page 2023
-
[25]
S. Deshpande, J. Lengiewicz, S. Bordas, MAgNET: A graph U-Net ar- chitecture for mesh-based simulations, arXiv preprint arXiv:2211.00713 (2022)
-
[26]
J. He, Y. Jiao, S. P. A. Bordas, On the use of graph neural networks and shape-function-based gradient computation in the deep energy method, International Journal for Numerical Methods in Engineering 124 (4) (2023) 864–879. 39
work page 2023
-
[27]
X. Fu, J. Hu, X. Zhang, H. Feng, An finite element analysis surrogate model with boundary oriented graph embedding approach for rapid de- sign, Journal of Computational Design and Engineering 10 (3) (2023) 1026–1046
work page 2023
-
[28]
Q. Chen, X. Wen, Y. Zhang, Predicting dynamic responses of contin- uous deformable bodies: A graph-based learning approach, Computer Methods in Applied Mechanics and Engineering 420 (2024) 116669
work page 2024
-
[29]
H. Zhou, Y. Zhao, H. Li, T. Pfaff, N. Li, A multi-level graph- based surrogate model for real-time high-fidelity sheet forming simulations, Advanced Engineering Informatics 66 (2025) 103458. doi:10.1016/j.aei.2025.103458
-
[30]
A. Sanchez-Gonzalez, J. Godwin, T. Pfaff, R. Ying, J. Leskovec, P. Battaglia, Learning to simulate complex physics with graph networks, in: International Conference on Machine Learning (ICML), 2020
work page 2020
-
[31]
M. Fortunato, T. Pfaff, K. Stachenfeld, P. Battaglia, Multiscale mesh- graphnets, in: 2nd AI4Science Workshop at the 39th International Con- ference on Machine Learning (ICML), 2022
work page 2022
- [32]
-
[33]
Y. Zhao, Q. Chen, H. Li, H. Zhou, H. R. Attar, T. Pfaff, T. Wu, N. Li, Recurrent U-Net-based graph neural network (RUGNN) for accurate de- formation predictions in sheet material forming, Advanced Engineering Informatics 69 (2026) 104021. doi:10.1016/j.aei.2025.104021
-
[34]
A. de Boer, M. S. van der Schoot, H. Bijl, Mesh deformation based on radial basis function interpolation, Computers & Structures 85 (11–14) (2007) 784–795. doi:10.1016/j.compstruc.2007.01.013
-
[35]
T. W. Sederberg, S. R. Parry, Free-form deformation of solid geometric models, in: Proceedings of the 13th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 1986, pp. 151–160. doi:10.1145/15922.15903. 40
-
[36]
V. Kraevoy, A. Sheffer, Cross-parameterization and compatible remesh- ing of 3d models, ACM Transactions on Graphics 23 (3) (2004) 861–869. doi:10.1145/1015706.1015811
-
[37]
J. Schreiner, A. Asirvatham, E. Praun, H. Hoppe, Inter-surface mapping, ACM Transactions on Graphics 23 (3) (2004) 870–877. doi:10.1145/1015706.1015812
-
[38]
M. Alexa, D. Cohen-Or, D. Levin, As-rigid-as-possible shape interpola- tion, in: Proceedings of EUROGRAPHICS 2000, Blackwell Publishers Ltd, 2000, pp. 157–164. doi:10.1111/1467-8659.00339
-
[39]
W. T. Tutte, How to draw a graph, Proceedings of the London Mathe- matical Society s3-13 (1) (1963) 743–767. doi:10.1112/plms/s3-13.1.743
-
[40]
M. S. Floater, Parametrization and smooth approximation of surface triangulations, Computer Aided Geometric Design 14 (3) (1997) 231–
work page 1997
-
[41]
doi:10.1016/S0167-8396(96)00031-3
-
[42]
M. S. Floater, Mean value coordinates, Computer Aided Geometric De- sign 20 (1) (2003) 19–27. doi:10.1016/S0167-8396(03)00002-5
-
[43]
B. Lévy, S. Petitjean, N. Ray, J. Maillot, Least squares conformal maps for automatic texture atlas generation, in: Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2002, pp. 362–371. doi:10.1145/566654.566590
-
[44]
M. Rabinovich, R. Poranne, D. Panozzo, O. Sorkine-Hornung, Scalable locally injective mappings, ACM Transactions on Graphics 36 (2) (2017) 16:1–16:16. doi:10.1145/2977606
-
[45]
K. He, X. Chen, S. Xie, Y. Li, P. Dollár, R. Girshick, Masked autoen- coders are scalable vision learners, in: Proceedings of the IEEE/CVF ConferenceonComputerVisionandPatternRecognition(CVPR),2022, pp. 16000–16009
work page 2022
-
[46]
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint (2019). arXiv:1810.04805. 41
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[47]
Z.Hou, X.Liu, Y.Cen, Y.Dong, H.Yang, C.Wang, J.Tang, Graphmae: Self-supervised masked graph autoencoders, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022, pp. 594–604
work page 2022
-
[48]
Z. Hou, X. Liu, Y. Cen, Y. Zhao, Y. Dong, H. Yang, C. Wang, J. Tang, Graphmae2: A decoding-enhanced masked self-supervised graph learner, in: Proceedings of the ACM Web Conference (WWW), 2023, pp. 737–746
work page 2023
-
[49]
P. Garnier, V. Lannelongue, J. Viquerat, E. Hachem, Meshmask: Pre- training mesh simulators with masked autoencoding, arXiv preprint (2025). arXiv:2502.10841
- [50]
- [51]
-
[52]
M. D. McKay, R. J. Beckman, W. J. Conover, A comparison of three methods for selecting values of input variables in the analysis of output from a computer code, Technometrics 21 (2) (1979) 239–245
work page 1979
-
[53]
T. Lehrer, P. Stocker, F. Duddeck, M. Wagner, Ucsm: Dataset of u-shaped parametric cad geometries and real-world sheet metal meshes for deep drawing, Computer-Aided Design 188 (2025) 103924. doi:10.1016/j.cad.2025.103924
- [54]
-
[55]
P. Garnier, J. Viquerat, E. Hachem, Multi-grid graph neural networks with self-attention for computational mechanics (2024). arXiv:2409.11899. URLhttps://arxiv.org/abs/2409.11899 42 Appendix A. Dataset parameters In this appendix, we provide the detailed parameter definitions used to generate the B-pillar and U-channel datasets. For the B-pillar case, as ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.