Learning parameter-dependent shear viscosity from data, with application to sea and land ice
Pith reviewed 2026-05-17 22:23 UTC · model grok-4.3
The pith
A neural network can learn temperature- and concentration-dependent ice viscosity from velocity or stress data while preserving isotropy and a convex dissipation potential.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors show that a neural network, when written in terms of tensor invariants and external parameters and equipped with architectural constraints that enforce isotropy and convexity of the dissipation potential, can accurately represent the shear viscosity of ice. The same network is trained either by direct regression against stress data or by PDE-constrained optimization against velocity observations. For land ice the network recovers the temperature-dependent Glen's law; for sea ice it recovers a concentration-dependent viscous-plastic shear term; and from floe data it produces a new model that exhibits both shear-thickening and shear-thinning and that continues to predict well at un
What carries the argument
Neural network that maps strain-rate tensor invariants and scalar parameters to effective viscosity, constructed so that the resulting rheology is isotropic and derives from a convex dissipation potential.
If this is right
- Glen's law temperature dependence for land ice can be recovered from velocity data alone.
- The shear component of the viscous-plastic sea-ice model can be learned as a function of concentration without assuming a specific algebraic form.
- The discovered rheology generalizes to parameter values and flow conditions not present in the training data.
- Both shear-thickening and shear-thinning regimes appear automatically as concentration changes.
Where Pith is reading between the lines
- The same invariant-based network could be applied to other non-Newtonian materials whose rheology depends on additional scalars such as temperature or particle volume fraction.
- Embedding the learned viscosity directly into large-scale ice-sheet or sea-ice codes would allow local adaptation of the flow law without manual retuning of parameters.
- The method supplies a route to quantify how uncertainty in observations propagates into uncertainty in the inferred viscosity function.
Load-bearing premise
The neural network whose architecture is built around tensor invariants can accurately represent the true underlying rheology from noisy or limited velocity or stress data while preserving convexity of the dissipation potential.
What would settle it
If the velocities computed with the learned viscosity disagree systematically with independent measurements of ice motion at temperatures or concentrations lying outside the training range, the inferred rheology does not capture the governing physics.
Figures
read the original abstract
Complex physical systems which exhibit fluid-like behavior are often modeled as non-Newtonian fluids. A crucial element of a non-Newtonian model is the rheology, which relates inner stresses with strain-rates. We propose a framework for inferring rheological models from data that represents the fluid's effective viscosity with a neural network. By writing the rheological law in terms of tensor invariants and tailoring the network's properties, the inferred model satisfies key physical and mathematical properties, such as isotropic frame-indifference and existence of a convex potential of dissipation. Within this framework, we propose two approaches to learning a fluid's rheology: 1) a standard regression that fits the rheological model to stress data and 2) a PDE-constrained optimization method that infers rheological models from velocity data. For the latter approach, we combine finite element and machine learning libraries. We demonstrate the accuracy and robustness of our method on land and sea ice rheologies which also depend on external parameters. For land ice, we infer the temperature-dependent Glen's law and, for sea ice, the concentration-dependent shear component of the viscous-plastic model. For these two models, we explore the effects of large data errors. Finally, we infer an unknown concentration-dependent model that reproduces Lagrangian ice floe simulation data. Our method discovers a rheology that generalizes well outside of the training dataset and exhibits both shear-thickening and thinning behaviors depending on the concentrations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a neural-network framework for inferring parameter-dependent shear viscosity in non-Newtonian fluid models, with applications to land and sea ice. By expressing the rheological law in terms of tensor invariants and tailoring the network architecture, the method enforces isotropic frame-indifference and the existence of a convex dissipation potential. Two inference strategies are presented: direct regression onto stress data and PDE-constrained optimization that fits velocity observations. The approach recovers the temperature-dependent Glen law for land ice and the concentration-dependent viscous-plastic shear term for sea ice, explores robustness to large data errors, and infers an unknown concentration-dependent rheology from Lagrangian floe simulations that is reported to generalize outside the training set and to exhibit both shear-thickening and shear-thinning regimes.
Significance. If the convexity and generalization properties are rigorously verified, the work would be significant for data-driven rheology modeling in glaciology and computational mechanics. It enables inference of complex, parameter-dependent constitutive laws from indirect velocity measurements while preserving key mathematical structure, and the hybrid finite-element/machine-learning implementation for PDE-constrained learning constitutes a practical technical advance.
major comments (2)
- [PDE-constrained optimization] PDE-constrained optimization section: the loss is defined on velocity mismatch rather than on the dissipation potential itself. Although the network architecture is designed to guarantee convexity for each fixed parameter value, no post-hoc diagnostic (e.g., eigenvalue analysis of the Hessian of the learned potential evaluated at held-out concentrations) is reported. This verification is load-bearing for the claim that the inferred thickening/thinning behavior is physically meaningful rather than an optimization artifact under noisy velocity data.
- [Numerical experiments] Results on the unknown concentration-dependent model (floe-simulation experiment): the generalization statement outside the training concentrations is central to the strongest claim, yet the manuscript provides only qualitative descriptions of the recovered behavior without quantitative error tables or direct comparison against established concentration-dependent baselines on the held-out data.
minor comments (2)
- [Abstract] The abstract states that the method 'explores the effects of large data errors' but does not indicate the precise noise levels, error models, or which figures/tables quantify those effects.
- [Methods] Notation for the neural-network input (tensor invariants) and output (viscosity) should be introduced with explicit dimension and symmetry properties in the methods section to improve readability for readers outside the immediate subfield.
Simulated Author's Rebuttal
We thank the referee for their careful reading and constructive feedback on our manuscript. We address each major comment below and outline the revisions we will make to strengthen the presentation and support for our claims.
read point-by-point responses
-
Referee: PDE-constrained optimization section: the loss is defined on velocity mismatch rather than on the dissipation potential itself. Although the network architecture is designed to guarantee convexity for each fixed parameter value, no post-hoc diagnostic (e.g., eigenvalue analysis of the Hessian of the learned potential evaluated at held-out concentrations) is reported. This verification is load-bearing for the claim that the inferred thickening/thinning behavior is physically meaningful rather than an optimization artifact under noisy velocity data.
Authors: We agree that an explicit post-training verification of convexity would provide stronger evidence that the observed shear-thickening and shear-thinning regimes are physically meaningful. Although the architecture enforces convexity of the dissipation potential by construction for each fixed parameter (via the choice of convex activations and invariant-based input), we will add a post-hoc diagnostic in the revised manuscript. Specifically, we will report the minimum eigenvalues of the Hessian of the learned potential evaluated at several held-out concentration values to confirm positive semi-definiteness and rule out optimization artifacts. revision: yes
-
Referee: Results on the unknown concentration-dependent model (floe-simulation experiment): the generalization statement outside the training concentrations is central to the strongest claim, yet the manuscript provides only qualitative descriptions of the recovered behavior without quantitative error tables or direct comparison against established concentration-dependent baselines on the held-out data.
Authors: We acknowledge that quantitative support would better substantiate the generalization claim. Because the target rheology is unknown a priori, established baselines do not exist; however, we can still quantify performance via velocity and stress prediction errors on held-out concentrations. In the revision we will add tables reporting these errors for the inferred model on both training and held-out data, together with comparisons against simple parametric baselines (e.g., constant-viscosity and linear-in-concentration models) to demonstrate the advantage of the learned nonlinear dependence. revision: yes
Circularity Check
No circularity: data-driven inference with architecture-enforced properties
full rationale
The paper's core contribution is a neural-network representation of parameter-dependent viscosity, constructed explicitly in terms of tensor invariants so that frame-indifference and convexity of the dissipation potential hold by architectural design rather than by any derived claim. Both the direct regression and PDE-constrained routes fit the network weights to external stress or velocity observations; the reported generalization, shear-thickening/thinning behavior, and recovery of Glen's law are therefore empirical outcomes on held-out data or simulation snapshots, not algebraic identities or self-citations. No load-bearing step reduces a prediction to a fitted input, renames a known result, or imports uniqueness from prior author work; the derivation chain remains self-contained against the supplied data.
Axiom & Free-Parameter Ledger
free parameters (1)
- neural network weights and biases
axioms (2)
- domain assumption The rheological law can be written in terms of tensor invariants to guarantee isotropic frame-indifference.
- domain assumption A convex potential of dissipation exists for the inferred viscosity.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
we assume the existence of a continuously differentiable and strictly convex potential of dissipation j... ∂j/∂(trDu)(ιDu)=ψ1... ∂j/∂(|Su|)(ιDu)=ψ2|Su|... j is strictly convex if and only if the function s↦ψ(s)s is strictly monotonically increasing
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative refines?
refinesRelation between the paper passage and the cited Recognition theorem.
The function Π... penalizes points where the derivative of ... ψ(|˙γ|,λ)˙γ is negative... weakly enforce... monotonically increasing in ˙γ
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
A. Akerson, A. Rajan, and K. Bhattacharya. Learning constitutive relations from ex- periments: 1. PDE-constrained optimization.Journal of the Mechanics and Physics of Solids, page 106128, 2025
work page 2025
-
[2]
M. G. Alhashim, K. Hausknecht, and M. P. Brenner. Control of flow behavior in complex fluids using automatic differentiation.Proceedings of the National Academy of Sciences, 122(8):e2403644122, 2025
work page 2025
-
[3]
B. Amos, L. Xu, and J. Z. Kolter. Input convex neural networks. InInternational conference on machine learning, pages 146–155. PMLR, 2017
work page 2017
-
[4]
E. Blockley, M. Vancoppenolle, E. Hunke, C. Bitz, D. Feltham, J.-F. Lemieux, M. Losch, E. Maisonnave, D. Notz, P. Rampal, et al. The future of sea ice modeling: Where do we go from here?Bulletin of the American Meteorological Society, 101(8):E1304–E1311, 2020
work page 2020
-
[5]
N. Bouziani, D. A. Ham, and A. Farsi. Differentiable programming across the PDE and machine learning barrier.arXiv preprint arXiv:2409.06085, 2024
-
[6]
G. Bruer and T. Isaac. Inferring ice sheet damage models from limited observations using CRIKit: the Constitutive Relation Inference Toolkit.arXiv preprint arXiv:2204.09748, 2022. 22
-
[7]
M. Coon, G. Maykut, R. Pritchard, D. Rorthrock, and A. Thorndike. Modeling the pack-ice as an elastic-plastic material.AIDJEX Bulletin, 24:1–105, 1974
work page 1974
- [8]
-
[9]
T. L. Edwards, S. Nowicki, B. Marzeion, R. Hock, H. Goelzer, H. Seroussi, N. C. Jourdain, D. A. Slater, F. E. Turner, C. J. Smith, et al. Projected land ice contributions to twenty- first-century sea level rise.Nature, 593(7857):74–82, 2021
work page 2021
-
[10]
L. C. Evans.Partial differential equations, volume 19. American Mathematical Society, 2022
work page 2022
-
[11]
D. L. Feltham. Sea ice rheology.Annu. Rev. Fluid Mech., 40:91–112, 2008
work page 2008
-
[12]
J. Glen. The flow law of ice: a discussion of the assumptions made in glacier theory, their experimental foundation and consequences. InPhysics of the Movement of Ice: Symposium at Chamonix 1958, pages 171–183. Int. Assoc. Hydrol. Sci., Wallingford, UK, 1958
work page 1958
-
[13]
R. Greve and H. Blatter.Dynamics of ice sheets and glaciers. Springer Science & Business Media, 2009
work page 2009
-
[14]
É. Guazzelli and O. Pouliquen. Rheology of dense granular suspensions.Journal of Fluid Mechanics, 852:P1, 2018
work page 2018
-
[15]
M. E. Gurtin.An introduction to continuum mechanics. Academic press, 1982
work page 1982
-
[16]
D. A. Ham, P. H. J. Kelly, L. Mitchell, C. J. Cotter, R. C. Kirby, K. Sagiyama, N. Bouziani, S. Vorderwuelbecke, T. J. Gregory, J. Betteridge, D. R. Shapero, R. W. Nixon-Hill, C. J. Ward, P. E. Farrell, P. D. Brubeck, I. Marsden, T. H. Gibson, M. Ho- molya, T. Sun, A. T. T. McRae, F. Luporini, A. Gregory, M. Lange, S. W. Funke, F. Rathgeber, G.-T. Bercea,...
work page 2023
-
[17]
A. Herman. Granular effects in sea ice rheology in the marginal ice zone.Philosophical Transactions of the Royal Society A, 380(2235):20210260, 2022
work page 2022
-
[18]
W. Hibler III. A dynamic thermodynamic sea ice model.Journal of physical oceanography, 9(4):815–846, 1979
work page 1979
-
[19]
J.Hu, J.Rudi, M.Gurnis, andG.Stadler. ConstrainingEarth’snonlinearmantleviscosity using plate-boundary resolving global inversions.Proceedings of the National Academy of Sciences, 121(28):e2318706121, 2024
work page 2024
-
[20]
J. T. Jenkins and S. B. Savage. A theory for the rapid flow of identical, smooth, nearly elastic, spherical particles.Journal of Fluid Mechanics, 130:187–202, 1983
work page 1983
-
[21]
P. Jop, Y. Forterre, and O. Pouliquen. A constitutive law for dense granular flows.Nature, 441(7094):727–730, 2006. 23
work page 2006
-
[22]
K. Kamrin and G. Koval. Nonlocal constitutive relation for steady granular flow.Physical review letters, 108(17):178301, 2012
work page 2012
-
[23]
A. Kontogiannis, R. Hodgkinson, S. Reynolds, and E. L. Manchester. Learning rheological parameters of non-Newtonian fluids from velocimetry data.Journal of Fluid Mechanics, 1011:R3, 2025
work page 2025
- [24]
- [25]
-
[26]
R. G. Larson.Constitutive equations for polymer melts and solutions. Butterworth- Heinemann, 2013
work page 2013
-
[27]
H. M. Laun. Rheological properties of aqueous polymer dispersions.Die Angewandte Makromolekulare Chemie: Applied Macromolecular Chemistry and Physics, 123(1):335– 359, 1984
work page 1984
-
[28]
K. R. Lennon, G. H. McKinley, and J. W. Swan. Scientific machine learning for mod- eling and simulating complex fluids.Proceedings of the National Academy of Sciences, 120(27):e2304669120, 2023
work page 2023
-
[29]
J. Lohmann and P. D. Ditlevsen. Risk of tipping the overturning circulation due to increasing rates of ice melt.Proceedings of the National Academy of Sciences, 118(9):e2017989118, 2021
work page 2021
-
[30]
M. Mahmoudabadbozchelou, K. M. Kamani, S. A. Rogers, and S. Jamali. Un- biased construction of constitutive relations for soft materials from experiments via rheology-informed neural networks.Proceedings of the National Academy of Sciences, 121(2):e2313658121, 2024
work page 2024
-
[31]
G. E. Manucharyan and B. P. Montemuro. SubZero: A sea ice model with an explicit representation of the floe life cycle.Journal of Advances in Modeling Earth Systems, 14(12):e2022MS003247, 2022
work page 2022
-
[32]
A. Morro and C. Giorgi.Mathematical modelling of continuum physics, volume 314. Springer, 2023
work page 2023
-
[33]
K. D. P. and J. Ba. Adam: A method for stochastic optimization.arXiv preprint arXiv:1412.6980, 1412(6), 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[34]
N. Parolini, A. Poiatti, J. Vené, and M. Verani. Structure-preserving neural networks in data-driven rheological models.SIAM Journal on Scientific Computing, 47(1):C182– C206, 2025. 24
work page 2025
- [35]
-
[36]
F. Pattyn, L. Perichon, A. Aschwanden, B. Breuer, B. de Smedt, O. Gagliardini, G. H. Gudmundsson, R. C. A. Hindmarsh, A. Hubbard, J. V. Johnson, T. Kleiner, Y. Kono- valov, C. Martin, A. J. Payne, D. Pollard, S. Price, M. Rückamp, F. Saito, O. Souček, S. Sugiyama, and T. Zwinger. Benchmark experiments for higher-order and full-stokes ice sheet models (ISM...
work page 2008
-
[37]
K. R. Rajagopal. On implicit constitutive theories for fluids.Journal of Fluid Mechanics, 550:243–249, 2006
work page 2006
-
[38]
G. Ranalli.Rheology of the Earth. Springer Science & Business Media, 1995
work page 1995
- [39]
- [40]
- [41]
- [42]
-
[43]
S. Toppaladoddi. A viscous continuum theory of sea ice motion based on stochastic floe dynamics.Journal of Fluid Mechanics, 1014:A6, 2025
work page 2025
-
[44]
C. Truesdell and K. R. Rajagopal.An introduction to the mechanics of fluids. Springer Science & Business Media, 2000
work page 2000
-
[45]
Y. Wang, C.-Y. Lai, D. J. Prior, and C. Cowen-Breen. Deep learning the flow law of Antarctic ice shelves.Science, 387(6739):1219–1224, 2025
work page 2025
-
[46]
A. V. Wilchinsky and D. L. Feltham. Modelling the rheology of sea ice as a collection of diamond-shaped floes.Journal of non-Newtonian fluid mechanics, 138(1):22–32, 2006
work page 2006
-
[47]
S. Wright and J. Nocedal. Numerical optimization.Springer Science, 35(67-68):7, 1999. 25
work page 1999
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.