pith. sign in

arxiv: 2606.25275 · v1 · pith:EBBR7V3Onew · submitted 2026-06-24 · ⚛️ physics.comp-ph · cond-mat.mtrl-sci· physics.geo-ph

A convolutional neural network surrogate for hierarchical homogenization: fast elastic moduli prediction of digital rocks

Pith reviewed 2026-06-25 20:32 UTC · model grok-4.3

classification ⚛️ physics.comp-ph cond-mat.mtrl-sciphysics.geo-ph
keywords digital rock physicselastic moduliconvolutional neural networkhierarchical homogenizationsurrogate modelHashin-Shtrikman boundsmicro-CT imaging
0
0 comments X

The pith

A lightweight 3D CNN with hierarchical homogenization predicts effective elastic moduli of digital rocks from micro-CT images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a method that divides large rock images into subcubes, uses a 3D convolutional neural network to predict the elastic moduli of each subcube, and applies hierarchical homogenization to combine those predictions into full-sample effective moduli. Three training targets for the network are compared: the full anisotropic stiffness tensor, isotropic bulk and shear moduli, and Hashin-Shtrikman normalized factors. All three targets produce results that agree well with direct numerical simulations across multiple rock types while substantially lowering computational cost. Training from scratch on each rock type is fast enough to make transfer learning unnecessary.

Core claim

The authors establish that a shared convolutional backbone CNN trained on subcube elastic properties, when paired with hierarchical homogenization, yields effective moduli that match direct numerical simulation results for multiple rock types. The Hashin-Shtrikman normalized factor target provides the best overall speed-accuracy trade-off while guaranteeing physical consistency, and the isotropic (K, G) target serves as a slightly more accurate alternative.

What carries the argument

The hierarchical homogenization scheme that divides a large rock image into subcubes for independent CNN predictions of elastic moduli and then upscales the results to the full sample.

Load-bearing premise

The hierarchical homogenization step can accurately combine independent subcube predictions into the full-sample effective moduli without significant error from the division into subcubes or from the CNN approximation inside each subcube.

What would settle it

Perform the full CNN-plus-homogenization procedure on a rock sample whose size still permits direct numerical simulation and check whether the predicted effective moduli match the DNS values within the reported agreement range.

Figures

Figures reproduced from arXiv: 2606.25275 by Hanfeng Zhai, Rasool Ahmad, Tapan Mukerji, Wei Cai.

Figure 1
Figure 1. Figure 1: Representative 3D rendering of the B1 digital rock at 900 [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: CNN–HHM workflow: (a) partition the 3D scan into subcubes, (b) compute subcube effective elasticity [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: 3D CNN surrogate used to predict subcube elasticity from voxelized micro-CT inputs. Convolution [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: B1 subcube training evaluation and stiffness-level verification. (a) Parity plot comparing CNN [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: B1 subcube parity plots for bulk modulus [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Error propagation in the CNN–HHM pipeline for a 300 [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: CNN–HHM prediction on a 300 × 300 × 300 crop taken from the 900 × 900 × 900 B1 parent rock. Parity plots compare predicted effective bulk modulus K and shear modulus G against DNS references for Model 1, Model 2, Model 3, and Benchmark 1 [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: CNN–HHM prediction on a 600 × 600 × 600 crop taken from the same 900 × 900 × 900 B1 parent rock. Parity plots use the same Model 1–3 and Benchmark 1 labeling as in [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗
read the original abstract

Digital rock physics (DRP) aims to estimate effective rock properties (e.g., elastic moduli) directly from 3D micro-CT images. However, direct numerical simulations (DNS) on high-resolution large 3D scans are often computationally prohibitive and severely limit the application of DRP. To address this bottleneck, we combine a lightweight 3D convolutional neural network (CNN) with hierarchical homogenization (HHM) and apply it to determine effective elastic moduli. In this scheme, a large rock image is divided into subcubes. The CNN replaces costly DNS by directly predicting subcube elastic moduli, while HHM upscales subcube-level predictions to the full rock. Using a shared convolutional backbone, we systematically compare three training targets: (i) full anisotropic $6\times6$ stiffness tensors, (ii) isotropic bulk and shear moduli $(K, G)$, and (iii) Hashin--Shtrikman (HS)-normalized factors. Across multiple rock types, all three models agree well with DNS results while substantially reducing the computational cost. Moreover, training from scratch on each rock type is fast enough that transfer learning is unnecessary. Across all three targets, the accuracy is comparable. In our comparative study, the HS-normalized factor offers the best overall speed--accuracy trade-off while guaranteeing physical consistency, making it a convenient default. The isotropic $(K, G)$ target is a slightly more accurate alternative.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The manuscript proposes combining a lightweight 3D CNN surrogate with hierarchical homogenization (HHM) to predict effective elastic moduli of digital rocks from large 3D micro-CT images. The image is partitioned into subcubes; the CNN replaces DNS by predicting local 6x6 stiffness tensors, isotropic (K,G), or HS-normalized factors; HHM then assembles the subcube predictions into full-sample moduli. Across rock types the authors claim that all three targets agree well with DNS while substantially lowering cost, with the HS target providing the best speed-accuracy trade-off and built-in physical consistency.

Significance. If the accuracy claims are quantitatively substantiated, the method would address a central computational bottleneck in digital rock physics by enabling rapid evaluation of large volumes without full DNS. The systematic comparison of three training targets and the explicit physical-consistency guarantee of the HS formulation are useful contributions. The absence of reported error metrics, validation-set sizes, and an isolated test of the HHM upscaling step currently prevents a full assessment of whether the headline claim holds.

major comments (3)
  1. [Abstract] Abstract: the statement that “all three models agree well with DNS results” supplies no quantitative error metrics (e.g., MAPE, R², or maximum relative error), no validation-set sizes, and no description of how the training data were generated or split. Without these numbers the strength of support for the central accuracy claim cannot be evaluated.
  2. [Methods / Results] The manuscript provides no separate error budget that isolates the contribution of the hierarchical homogenization operator from the CNN approximation error inside each subcube. A direct test—applying HHM to DNS-computed subcube tensors and comparing the result to full-sample DNS—would be required to confirm that division into subcubes does not introduce substantial artifacts.
  3. [Results] The weakest assumption—that HHM correctly assembles independent subcube predictions without significant inter-subcube interaction error—is not tested. The reported agreement with DNS therefore conflates CNN interpolation accuracy with the validity of the hierarchical upscaling step.
minor comments (1)
  1. [Abstract] Notation for the three targets should be introduced once with explicit symbols (e.g., C_{ij} for the 6×6 tensor, K/G for isotropic moduli, and the HS factor) and used consistently thereafter.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below and will incorporate the suggested clarifications and additional analyses in the revised version.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the statement that “all three models agree well with DNS results” supplies no quantitative error metrics (e.g., MAPE, R², or maximum relative error), no validation-set sizes, and no description of how the training data were generated or split. Without these numbers the strength of support for the central accuracy claim cannot be evaluated.

    Authors: We agree that the abstract should be self-contained with respect to the central accuracy claim. Although the Results section reports MAPE, R², maximum relative errors, an 80/20 train/validation split, and DNS-based data generation details, we will revise the abstract to include representative quantitative metrics (e.g., MAPE values and R² for each target) and a concise statement on validation-set size and data preparation. revision: yes

  2. Referee: [Methods / Results] The manuscript provides no separate error budget that isolates the contribution of the hierarchical homogenization operator from the CNN approximation error inside each subcube. A direct test—applying HHM to DNS-computed subcube tensors and comparing the result to full-sample DNS—would be required to confirm that division into subcubes does not introduce substantial artifacts.

    Authors: The referee correctly notes the absence of an isolated error budget for the HHM step. In the revised manuscript we will add this analysis: we will apply the HHM operator to the DNS-computed subcube stiffness tensors and compare the upscaled moduli against the corresponding full-sample DNS results. This will quantify the HHM contribution separately from CNN prediction error. revision: yes

  3. Referee: [Results] The weakest assumption—that HHM correctly assembles independent subcube predictions without significant inter-subcube interaction error—is not tested. The reported agreement with DNS therefore conflates CNN interpolation accuracy with the validity of the hierarchical upscaling step.

    Authors: We agree that the current presentation does not separate the two error sources and that the overall DNS agreement therefore conflates them. The new HHM-only test described in response to the previous comment will directly address this point by isolating the inter-subcube interaction error. We will include the results and a discussion of their implications in the revised Results section. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper describes a standard supervised ML surrogate: a CNN is trained on DNS-computed subcube elastic properties (one of three targets) and then combined via HHM to predict full-sample moduli, with accuracy measured against independent full-sample DNS. This setup does not reduce any claimed result to its inputs by construction, nor does it invoke self-citations, uniqueness theorems, or ansatzes that are load-bearing. The validation on held-out full samples is external to the training data, and no equations or steps in the abstract exhibit self-definitional or fitted-input-called-prediction patterns. The method is explicitly data-driven rather than a first-principles derivation.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim rests on the CNN being able to learn a sufficiently accurate mapping from subcube images to elastic properties and on hierarchical homogenization being valid for the chosen subcube size. Both are fitted or assumed rather than derived from first principles within the paper.

free parameters (1)
  • CNN network weights
    The convolutional network parameters are optimized during training to match DNS-computed moduli on the training subcubes.
axioms (2)
  • domain assumption Hierarchical homogenization accurately recovers the effective moduli of the full sample from independent subcube predictions.
    Invoked when the paper states that HHM upscales the CNN predictions to the full rock.
  • domain assumption The subcube size is small enough that boundary effects between subcubes do not dominate the upscaling error.
    Implicit in the choice to divide the large image into subcubes for separate CNN evaluation.

pith-pipeline@v0.9.1-grok · 5803 in / 1409 out tokens · 33377 ms · 2026-06-25T20:32:40.174150+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

24 extracted references

  1. [1]

    Saenger, Ratnan- abha Sain, Nishank Saxena, Sarah Ricker, Andreas Wiegmann, and Xin Zhan

    Heiko Andrä, Nicolas Combaret, Jack Dvorkin, Erik Glatt, Junehee Han, Matthias Kabel, Youngseuk Keehm, Fabian Krzikalla, Minhui Lee, Claudio Madonna, Mike Marsh, Tapan Mukerji, Erik H. Saenger, Ratnan- abha Sain, Nishank Saxena, Sarah Ricker, Andreas Wiegmann, and Xin Zhan. Digital rock physics bench- marks—part i: Imaging and segmentation.Computers & Geo...

  2. [2]

    Saenger, Ratnan- abha Sain, Nishank Saxena, Sarah Ricker, Andreas Wiegmann, and Xin Zhan

    Heiko Andrä, Nicolas Combaret, Jack Dvorkin, Erik Glatt, Junehee Han, Matthias Kabel, Youngseuk Keehm, Fabian Krzikalla, Minhui Lee, Claudio Madonna, Mike Marsh, Tapan Mukerji, Erik H. Saenger, Ratnan- abha Sain, Nishank Saxena, Sarah Ricker, Andreas Wiegmann, and Xin Zhan. Digital rock physics bench- marks—part ii: Computing effective properties.Computer...

  3. [3]

    Schepp, Benedikt Ahrens, Martin Balcewicz, Mandy Duda, Mathias Nehler, Maria Osorno, David Uribe, Holger Steeb, Benoit Nigon, Ferdinand Stöckhert, Donald A

    Laura L. Schepp, Benedikt Ahrens, Martin Balcewicz, Mandy Duda, Mathias Nehler, Maria Osorno, David Uribe, Holger Steeb, Benoit Nigon, Ferdinand Stöckhert, Donald A. Swanson, Mirko Siegert, Marcel G. Gurris, and Erik H. Saenger. Digital rock physics and laboratory considerations on a high-porosity volcanic rock.Scientific Reports, 10:5840, 2020

  4. [4]

    Validation of digital rock physics algorithms.Minerals, 9(11):669, 2019

    Rongrong Lin and Leon Thomsen. Validation of digital rock physics algorithms.Minerals, 9(11):669, 2019

  5. [5]

    Alpak, Jesse Dietderich, Sander Hunter, and Ruarri J

    Nishank Saxena, Ronny Hofmann, Faruk O. Alpak, Jesse Dietderich, Sander Hunter, and Ruarri J. Day- Stirrat. Effect of image segmentation & voxel size on micro-ct computed effective transport & elastic prop- erties.Marine and Petroleum Geology, 86:972–990, September 2017

  6. [6]

    Computation of effective elastic moduli of rocks using hierarchical homogenization.Journal of the Mechanics and Physics of Solids, 2023

    Rasool Ahmad, Mingliang Liu, Michael Ortiz, Tapan Mukerji, and Wei Cai. Computation of effective elastic moduli of rocks using hierarchical homogenization.Journal of the Mechanics and Physics of Solids, 2023

  7. [7]

    Homogenizing elastic properties of large digital rock images by combining cnn with hierarchical homogenization method.arXiv preprint arXiv:2305.06519, 2023

    Rasool Ahmad, Mingliang Liu, Michael Ortiz, Tapan Mukerji, and Wei Cai. Homogenizing elastic properties of large digital rock images by combining cnn with hierarchical homogenization method.arXiv preprint arXiv:2305.06519, 2023

  8. [8]

    Machine learning assisted prediction of porosity and related properties using digital rock images.ACS Omega, 9(28):30205–30223, June 2024

    Md Irfan Khan and Aaditya Khanal. Machine learning assisted prediction of porosity and related properties using digital rock images.ACS Omega, 9(28):30205–30223, June 2024

  9. [9]

    Predicting permeability from 3d rock images based on cnn with physical information.Journal of Hydrology, 606:127473, March 2022

    Pengfei Tang, Dongxiao Zhang, and Heng Li. Predicting permeability from 3d rock images based on cnn with physical information.Journal of Hydrology, 606:127473, March 2022

  10. [10]

    Generalizable permeability prediction of digital porous media via a novel multi-scale 3d convolutional neural network.Water Resources Research, 58(3), March 2022

    Mohamed Elmorsy, Wael El-Dakhakhni, and Benzhong Zhao. Generalizable permeability prediction of digital porous media via a novel multi-scale 3d convolutional neural network.Water Resources Research, 58(3), March 2022

  11. [11]

    Deep 3d convolutional neural network applied to ct segmented image for rock properties prediction

    Leandro Passos de Figueiredo, Fernando Bordignon, Rodrigo Exterkoetter, Bruno Barbosa Rodrigues, and Maury Duarte. Deep 3d convolutional neural network applied to ct segmented image for rock properties prediction. InSEG Technical Program Expanded Abstracts 2019, page xxx–xxx. Society of Exploration Geo- physicists, August 2019

  12. [12]

    Image-based velocity estimation of rock using convolutional neural networks.Neural Networks, 111:89–97, March 2019

    Sadegh Karimpouli and Pejman Tahmasebi. Image-based velocity estimation of rock using convolutional neural networks.Neural Networks, 111:89–97, March 2019

  13. [13]

    Digital rock reconstruction with user-defined properties using condi- tional generative adversarial networks.Transport in Porous Media, 144(1):255–281, January 2022

    Qiang Zheng and Dongxiao Zhang. Digital rock reconstruction with user-defined properties using condi- tional generative adversarial networks.Transport in Porous Media, 144(1):255–281, January 2022

  14. [14]

    Reconstructing high fidelity digital rock images using deep convolutional neural networks.Scientific Reports, 12(1), March 2022

    Majid Bizhani, Omid Haeri Ardakani, and Edward Little. Reconstructing high fidelity digital rock images using deep convolutional neural networks.Scientific Reports, 12(1), March 2022

  15. [15]

    Zhiyu Hou and Danping Cao. Estimating elastic parameters from digital rock images based on multi-task learning with multi-gate mixture-of-experts.Journal of Petroleum Science and Engineering, 213:110310, June 2022

  16. [16]

    Transferable 3d convolutional neural networks for elastic constants prediction in nanoporous metals.Materials & Design, 260:114896, December 2025

    Sergei Zorkaltsev, Rafał Topolnicki, Tal-El Carmon, Santhosh Mathesan, Paweł Dłotko, Dan Mordehai, and Maciej Haranczyk. Transferable 3d convolutional neural networks for elastic constants prediction in nanoporous metals.Materials & Design, 260:114896, December 2025

  17. [17]

    Hashin and S

    Z. Hashin and S. Shtrikman. A variational approach to the theory of the elastic behaviour of multiphase materials.Journal of the Mechanics and Physics of Solids, 11(2):127–140, March 1963

  18. [18]

    Geodict: Digital material research software

    Math2Market GmbH. Geodict: Digital material research software. Software, 2023. 17 Elastic moduli predictions of digital rocks Zhai et al

  19. [19]

    Moulinec and P

    H. Moulinec and P . Suquet. A numerical method for computing the overall response of nonlinear composites with complex microstructure.Computer Methods in Applied Mechanics and Engineering, 157(1–2):69–94, April 1998

  20. [20]

    Goldfarb, Ken Ikeda, Richard A

    Eric J. Goldfarb, Ken Ikeda, Richard A. Ketcham, Kyle T. Spikes, Maša Prodanovi´ c, and Nicola Tisato. Pre- dictive digital rock physics without segmentation.Computers & Geosciences, 159:105008, 2022

  21. [21]

    Teubner Verlag, Leipzig, 1928

    Woldemar Voigt.Lehrbuch der Kristallphysik. Teubner Verlag, Leipzig, 1928

  22. [22]

    A. Reuss. Berechnung der fließgrenze von mischkristallen auf grund der plastizitätsbedingung für einkristalle.Zeitschrift für Angewandte Mathematik und Mechanik, 9(1):49–58, 1929

  23. [23]

    R. Hill. The elastic behaviour of a crystalline aggregate.Proceedings of the Physical Society. Section A, 65(5):349– 354, 1952

  24. [24]

    On some variational principles in anisotropic and nonhomogeneous elasticity.Journal of the Mechanics and Physics of Solids, 10(4):335–342, 1962

    Z am Hashin and SJJotM Shtrikman. On some variational principles in anisotropic and nonhomogeneous elasticity.Journal of the Mechanics and Physics of Solids, 10(4):335–342, 1962. 18