Recognition: 1 theorem link
· Lean TheoremA Differentiable Framework for Gradient Enhanced Damage with Physics-Augmented Neural Networks in JAX-FEM
Pith reviewed 2026-05-13 18:05 UTC · model grok-4.3
The pith
Input-convex neural networks combined with gradient-enhanced damage enable mesh-independent simulations of soft material degradation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The elastic strain energy and damage yield function are each represented by input-convex neural networks that guarantee polyconvexity and thermodynamic consistency by construction; these networks are embedded in a gradient-enhanced damage model whose non-local field is governed by an auxiliary PDE, yielding mesh-independent finite-element solutions for strain-softening in soft materials.
What carries the argument
Input-convex neural networks (ICNNs) that enforce polyconvexity of the strain-energy function and satisfaction of the Clausius-Duhem inequality by design, together with the non-local damage variable introduced through an auxiliary gradient-enhanced partial differential equation.
If this is right
- Damage simulations become possible with constitutive models learned directly from experimental data while remaining thermodynamically consistent.
- Mesh sensitivity disappears in strain-softening problems once the non-local damage PDE is solved alongside the mechanical equilibrium equations.
- The fully differentiable implementation supports automatic differentiation for parameter calibration and design optimization tasks.
- Open-source release of the JAX-FEM code reduces the effort required to apply physics-augmented neural networks to new soft-material problems.
Where Pith is reading between the lines
- The same architecture could be extended to couple damage with viscoelasticity or growth by adding further network outputs and auxiliary fields.
- Because the framework is differentiable, material parameters inside the networks could be optimized directly against full-field experimental measurements.
- Transfer of the trained networks across different soft materials may be feasible if the ICNN structure preserves the required convexity properties under modest retraining.
Load-bearing premise
The input-convex neural networks trained on limited data will continue to enforce polyconvexity and the Clausius-Duhem inequality for all unseen loading paths, and the added non-local PDE will fully remove mesh dependence in three-dimensional heterogeneous geometries.
What would settle it
Observation of continued mesh-dependent damage localization in a three-dimensional heterogeneous notched specimen, or violation of thermodynamic consistency in the stress response along a loading path outside the training set.
Figures
read the original abstract
Soft materials such as rubbers, hydrogels, and biological tissues undergo damage in the form of stiffness degradation without apparent changes in their stress-free geometry. Accurate simulation of this behavior is critical in applications ranging from soft robotics to the design of medical devices, yet two persistent challenges are the difficulty of constructing flexible, thermodynamically consistent constitutive models, and the mesh dependence of finite element solutions caused by strain softening. Here we address both challenges simultaneously by combining physics-augmented neural network constitutive models with a gradient-enhanced damage formulation implemented within the differentiable finite element framework JAX-FEM. The elastic strain energy and the damage yield function are each parameterized by input-convex neural networks (ICNNs), which enforce polyconvexity and satisfaction of the Clausius--Duhem inequality by design. The gradient-enhanced formulation introduces a non-local damage field governed by an additional partial differential equation, regularizing the spatial distribution of damage and eliminating mesh dependence. The implementation is validated through local stress-strain fits, single-element parametric studies, a mesh and solution strategy study for a uniform deformation case, and a notched plate simulation. The results demonstrate that the proposed framework enables flexible, data-driven, mesh-independent damage simulation for a broad class of soft materials. We anticipate that the open-source implementation will lower the barrier to adopting physics-augmented neural network constitutive models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a differentiable framework that parameterizes the elastic strain-energy density and damage yield function using input-convex neural networks (ICNNs) within a gradient-enhanced damage formulation, all implemented in the JAX-FEM code. The ICNN architecture is intended to enforce polyconvexity and thermodynamic consistency (Clausius-Duhem) by construction, while the non-local damage PDE regularizes the problem to remove mesh dependence. Validation consists of local stress-strain fits, single-element parametric studies, a mesh/solution-strategy study on uniform deformation, and a notched-plate example; the central claim is that the approach enables flexible, data-driven, mesh-independent damage simulation for soft materials.
Significance. If the validation evidence were quantitative and the out-of-distribution consistency were demonstrated, the work would offer a useful bridge between data-driven constitutive modeling and physics-constrained finite-element simulation. The open-source JAX-FEM implementation and the simultaneous treatment of constitutive flexibility and numerical regularization would be genuine strengths for the computational mechanics community working on soft materials.
major comments (2)
- [Abstract / validation studies] Abstract and validation description: the manuscript states that validation was performed via stress-strain fits, parametric studies, mesh studies, and a notched-plate example, yet reports no quantitative error metrics, convergence rates, or comparisons against analytical solutions. Without these, the claims of accuracy and mesh independence remain unsubstantiated and constitute a load-bearing gap for the central contribution.
- [ICNN constitutive model section] ICNN parameterization and consistency enforcement: the architecture guarantees polyconvexity and Clausius-Duhem satisfaction only inside the convex hull of the (limited) training data. No systematic verification is provided that the learned functions continue to satisfy the required inequalities on out-of-distribution paths that arise once the non-local damage PDE couples fields across a heterogeneous 3-D mesh.
minor comments (2)
- [Abstract] The abstract would be strengthened by indicating the volume and type of training data used for the ICNNs and the specific soft-material classes considered.
- [Governing equations] Notation for the non-local damage variable and its coupling to the local damage field should be made fully explicit in the governing equations to aid reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which help clarify the validation requirements and consistency aspects of our framework. We address each major comment below.
read point-by-point responses
-
Referee: [Abstract / validation studies] Abstract and validation description: the manuscript states that validation was performed via stress-strain fits, parametric studies, mesh studies, and a notched-plate example, yet reports no quantitative error metrics, convergence rates, or comparisons against analytical solutions. Without these, the claims of accuracy and mesh independence remain unsubstantiated and constitute a load-bearing gap for the central contribution.
Authors: We agree that quantitative metrics would strengthen the substantiation of accuracy and mesh independence. In the revised manuscript we will add explicit error metrics, including relative L2 errors between predicted and reference stress-strain curves for the local fits, convergence rates (error versus element size) for the uniform deformation mesh study, and direct comparisons to analytical solutions for the single-element parametric studies. For the notched-plate example we will report quantitative indicators of mesh independence such as variation in peak reaction force and damage-zone width across successively refined meshes. revision: yes
-
Referee: [ICNN constitutive model section] ICNN parameterization and consistency enforcement: the architecture guarantees polyconvexity and Clausius-Duhem satisfaction only inside the convex hull of the (limited) training data. No systematic verification is provided that the learned functions continue to satisfy the required inequalities on out-of-distribution paths that arise once the non-local damage PDE couples fields across a heterogeneous 3-D mesh.
Authors: The ICNN architecture enforces polyconvexity and Clausius-Duhem satisfaction globally by construction (non-negative weights and convex activations) for all inputs, independent of the training-data convex hull; the training data only governs approximation fidelity inside the sampled region. We acknowledge that no explicit out-of-distribution verification was included for paths arising in the coupled 3-D simulations. In the revision we will add a dedicated verification subsection that extracts representative strain histories from the notched-plate simulation (including paths outside the original training hull) and confirms that the required inequalities continue to hold. revision: partial
Circularity Check
No significant circularity; modeling choices and architectural constraints are independent of fitted outputs
full rationale
The paper describes an implementation framework that parameterizes strain energy and yield functions via input-convex neural networks chosen specifically to enforce polyconvexity and the Clausius-Duhem inequality by architecture, then couples this to a standard gradient-enhanced non-local damage PDE inside JAX-FEM. These are explicit design decisions rather than derivations in which a claimed prediction reduces to a fitted parameter or self-citation by construction. No equation equates an output quantity directly to an input fit; validation proceeds through separate local fits, single-element tests, and mesh studies whose success is not presupposed by the constitutive choice itself. The central result (mesh-independent data-driven damage simulation) therefore rests on independent numerical evidence and does not collapse to its own inputs.
Axiom & Free-Parameter Ledger
free parameters (1)
- ICNN weights for strain-energy and yield functions
axioms (2)
- domain assumption Input-convex neural networks enforce polyconvexity and satisfaction of the Clausius-Duhem inequality by design
- domain assumption The gradient-enhanced damage PDE regularizes the spatial distribution of damage and eliminates mesh dependence
Reference graph
Works this paper leans on
-
[1]
A Physics-Augmented Machine LearningConstitutiveModelforDamageinSolids. URL:http://arxiv. org/abs/2508.05638, doi:10.48550/arXiv.2508.05638. arXiv:2508.05638 [physics]. Amos, B., Xu, L., Kolter, J.Z.,
-
[2]
Computer Methods in Applied Mechanics and Engineering 405, 115867
Automated discovery of generalized standard material models with euclid. Computer Methods in Applied Mechanics and Engineering 405, 115867. Fuhg, J.N., Anantha Padmanabha, G., Bouklas, N., Bahmani, B., Sun, W., Vlassis,N.N.,Flaschel,M.,Carrara,P.,DeLorenzis,L.,2025. Areview on data-driven constitutive laws for solids: Jn fuhg et al. Archives of Computatio...
work page 2025
-
[3]
Finite Elements in Analysis and Design doi:10.1016/J.FINEL.2025.104358
A critical comparison of gradient and integral nonlocal damage models: Formulation, numerical predictions and computational aspects. Finite Elements in Analysis and Design doi:10.1016/J.FINEL.2025.104358. He, Y., Zuo, D., Hackl, K., Yang, H., Mousavi, S.J., Avril, S.,
-
[4]
Aphysics-augmentedneu- ral network framework for finite strain incompressible viscoelasticity
Kalina,K.A.,Brummund,J.,Kästner,M.,2026. Aphysics-augmentedneu- ral network framework for finite strain incompressible viscoelasticity. ComputerMethodsinAppliedMechanicsandEngineering455,118892. Li,W.,2016. Damagemodelsforsofttissues:asurvey. JournalofMedical and Biological Engineering 36, 285–307. Linden,L.,Klein,D.K.,Kalina,K.A.,Brummund,J.,Weeger,O.,Kä...
work page 2026
-
[5]
Proceedings of the Royal Society of London
A pseudo–elastic model for the mullins effect in filled rubber. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 455, 2861–2877. Ostwald,R.,Kuhl,E.,Menzel,A.,2019.Ontheimplementationoffinitede- formation gradient-enhanced damage models. Computational Mechan- ics 64, 847–877. URL:https://doi.org/10.1007/...
-
[6]
Biomechanics and modeling in mechanobiology 16, 249–261
Modeling soft tissue damage and failure using a combined particle/continuum approach. Biomechanics and modeling in mechanobiology 16, 249–261. Ricker,A.,Kröger,N.H.,Wriggers,P.,2021. Comparisonofdiscontinuous damage models of mullins-type. Archive of Applied Mechanics 91, 4097–4119. Schröder,J.,Neff,P.,2003. Invariantformulationofhyperelastictransverse is...
work page 2021
-
[7]
Engineering Fracture Mechanics doi:10.1016/J.ENGFRACMECH.2018
An efficient fe-implementation of implicit gradient-enhanced damage models to simulate ductile fail- ure. Engineering Fracture Mechanics doi:10.1016/J.ENGFRACMECH.2018. 01.022. Simo, J.C., Ju, J.W.,
-
[8]
arXiv preprint arXiv:2412.07126
Fe-pinns: finite-element-based physics- informed neural networks for surrogate modeling. arXiv preprint arXiv:2412.07126 . Tac, V., Costabal, F.S., Tepole, A.B.,
-
[9]
Journal of the Mechanics and Physics of Solids 169, 105076
Nn-euclid: Deep-learning hyperelasticity without stress data. Journal of the Mechanics and Physics of Solids 169, 105076. Waffenschmidt,T.,Polindara,C.,Menzel,A.,Blanco,S.,2014. Agradient- enhanced large-deformation continuum damage model for fibre- reinforcedmaterials. ComputerMethodsinAppliedMechanicsandEn- gineering268,801–842. URL:https://www.scienced...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.