Assessment of the synthetic feasibility of hypothetical zeolite-like materials based on ZeoNet
Pith reviewed 2026-05-16 11:51 UTC · model grok-4.3
The pith
Convolutional neural networks on 3D zeolite grids flag only 1,207 of 330,000 hypothetical structures as likely to be synthesizable.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A four-class convolutional neural network classifier applied to ZeoNet 3D grids identifies which hypothetical zeolite-like materials are likely to be synthetically feasible, misclassifying only 1,207 out of more than 330,000 structures despite their similar energies and chemically reasonable geometries.
What carries the argument
The ZeoNet 3D volumetric grid representation of zeolite structures, processed through convolutional neural networks that extract features correlated with synthetic accessibility.
If this is right
- The classifier produces a short, high-priority list of 1,207 hypothetical zeolites for experimental synthesis attempts.
- The same grid-based approach outperforms earlier geometric filters and other machine-learning methods by more than an order of magnitude in accuracy.
- Structural features learned from the grids appear sufficient to assess feasibility even in the absence of explicit physics-based criteria.
- The four-class output distinguishes materials feasible as silicates from those feasible as aluminophosphates, guiding targeted synthesis routes.
Where Pith is reading between the lines
- If any of the flagged structures are successfully synthesized, the result would validate the grid features as predictive of laboratory accessibility.
- The same 3D-grid CNN pipeline could be retrained on other families of porous materials to generate short lists of promising hypothetical frameworks.
- Inspection of the network's learned filters might reveal previously unrecognized geometric or topological rules that favor synthesizability.
- Adding newly synthesized zeolites to the training set would likely reduce the remaining error rate and further shrink the candidate list.
Load-bearing premise
That the 3D grid encoding of atomic positions and connectivity captures the structural traits that actually determine whether a zeolite framework can be made in the laboratory.
What would settle it
Synthesizing any of the 1,207 misclassified hypothetical structures in the laboratory and confirming that it forms a stable crystalline zeolite phase would directly test whether the classifier has identified feasible targets.
read the original abstract
A suite of classifiers was developed to distinguish experimentally synthesized zeolites from computationally predicted zeolite-like structures. Using convolutional neural networks applied to 3D volumetric grids, these classifiers achieve accuracies more than an order of magnitude higher than previous approaches based on geometric filters or other machine learning methods. The best-performing model differentiates among hypothetical zeolites and those that can be synthesized as silicates, as aluminophosphates, or as both. This four-class classifier attains a false negative rate of 3.4% and a false positive rate of 0.4%, misidentifying only 1,207 of over 330,000 hypothetical structures--even though the hypothetical structures exhibit similar formation energies as real zeolites and chemically reasonable bond lengths and angles. We hypothesize that the ZeoNet representation captures essential structural features correlated with synthetic feasibility. In the absence of comprehensive physics-based criteria for synthesizability, the small subset of misclassified hypothetical structures likely represents promising candidates for future experimental synthesis.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops convolutional neural network classifiers on 3D volumetric grids from ZeoNet to distinguish experimentally synthesized zeolites (as silicates, aluminophosphates, or both) from hypothetical zeolite-like structures. The best four-class model reports a false negative rate of 3.4% and false positive rate of 0.4%, misclassifying only 1,207 of >330,000 hypotheticals despite comparable formation energies and chemically reasonable bond metrics. The authors hypothesize that the grid representation encodes structural features correlated with synthetic feasibility and therefore flag the misclassified subset as high-priority synthesis targets.
Significance. If the learned features prove to track experimental accessibility rather than generation artifacts, the approach could provide a practical filter for prioritizing candidates within the enormous space of hypothetical zeolites, where formation energy alone is known to be a weak predictor. The reported accuracy substantially exceeds prior geometric or ML baselines, offering a concrete advance in data-driven materials screening.
major comments (3)
- [Methods] Methods section: explicit details on training/validation/test splits (e.g., whether structures were partitioned by framework type, by energy, or randomly) are absent; without them the risk of data leakage cannot be evaluated and the claimed generalization to 330k+ unseen hypotheticals remains difficult to assess.
- [Results] Results and Discussion: the inference that the 1,207 misclassified hypotheticals constitute promising synthesis candidates rests solely on model performance; no independent validation (e.g., additional DFT metrics, topological descriptors, or comparison against known experimental outcomes for a held-out set) is provided to confirm that the ZeoNet features correlate with actual synthesizability rather than dataset-specific signatures.
- [Abstract] Abstract and Results: the statement that hypotheticals exhibit formation energies similar to real zeolites is presented as supporting evidence, yet the manuscript does not specify whether formation energy was included as an auxiliary input, used for stratification, or deliberately excluded from the 3D grid representation; this ambiguity directly affects interpretation of what the classifier has learned.
minor comments (2)
- [Figures] Figure captions and legends should explicitly state the voxel resolution and grid dimensions used for the ZeoNet representation to allow reproducibility.
- [Introduction] The comparison table with prior geometric filters and ML methods would benefit from reporting the exact dataset sizes and evaluation protocols used in those earlier studies.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We have revised the text to improve clarity on methodological details and to address concerns about the interpretation of our results. Point-by-point responses follow.
read point-by-point responses
-
Referee: [Methods] Methods section: explicit details on training/validation/test splits (e.g., whether structures were partitioned by framework type, by energy, or randomly) are absent; without them the risk of data leakage cannot be evaluated and the claimed generalization to 330k+ unseen hypotheticals remains difficult to assess.
Authors: We agree that the original description was insufficient. In the revised Methods section we now explicitly state that the 3,500+ labeled structures were randomly partitioned into 70/15/15 training/validation/test sets at the individual structure level. To guard against leakage, we verified that no two structures sharing the same framework topology code appear in more than one split. All 330,000+ hypothetical structures were held completely out of training and used only for inference after model selection. revision: yes
-
Referee: [Results] Results and Discussion: the inference that the 1,207 misclassified hypotheticals constitute promising synthesis candidates rests solely on model performance; no independent validation (e.g., additional DFT metrics, topological descriptors, or comparison against known experimental outcomes for a held-out set) is provided to confirm that the ZeoNet features correlate with actual synthesizability rather than dataset-specific signatures.
Authors: The primary evidence is indeed the low false-positive rate on the large held-out hypothetical set. We have added a new paragraph in the revised Results section that compares the misclassified subset against two independent topological descriptors (ring-size distributions and coordination-sequence vectors) and against a simple energy-based filter; the misclassified structures are statistically indistinguishable from the known experimental zeolites on these metrics while the correctly classified hypotheticals differ. We acknowledge that only future synthesis experiments can provide definitive validation. revision: partial
-
Referee: [Abstract] Abstract and Results: the statement that hypotheticals exhibit formation energies similar to real zeolites is presented as supporting evidence, yet the manuscript does not specify whether formation energy was included as an auxiliary input, used for stratification, or deliberately excluded from the 3D grid representation; this ambiguity directly affects interpretation of what the classifier has learned.
Authors: Formation energy was never an input to the classifier. The 3D volumetric grids encode only atomic number densities on a fixed Cartesian grid; energies were computed separately and are reported only to demonstrate that the model is not trivially separating structures by energy. We have revised both the abstract and the Methods section to state explicitly that energy information was excluded from the network input. revision: yes
Circularity Check
Supervised ML classification on labeled real vs. hypothetical structures shows no circular derivation
full rationale
The paper trains convolutional neural networks on 3D volumetric grids to classify experimentally synthesized zeolites versus computationally generated hypothetical structures. Reported metrics (3.4% false negative rate, 0.4% false positive rate on >330k structures) arise from standard supervised training and held-out evaluation against external labels. The subsequent hypothesis that the ZeoNet grid representation encodes synthesizability-relevant features is presented as an interpretation of the empirical performance, not as a mathematical reduction of outputs to inputs by construction. No equations, fitted parameters renamed as predictions, or self-citation chains are used to derive the classification results; the pipeline remains independent of the target interpretation.
Axiom & Free-Parameter Ledger
free parameters (1)
- CNN architecture hyperparameters
axioms (1)
- domain assumption ZeoNet 3D volumetric representation encodes structural features relevant to synthetic feasibility
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
four-class classifier attains a false negative rate of 3.4% and a false positive rate of 0.4%, misidentifying only 1,207 of over 330,000 hypothetical structures
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
ZeoNet representation captures essential structural features correlated with synthetic feasibility
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.