Discovery and recovery of crystalline materials with property-conditioned transformers
read the original abstract
Generative models have recently shown great promise for accelerating the design and discovery of new functional materials. Conditional generation enhances this capacity by allowing inverse design, where specific desired properties can be requested during the generation process. However, conditioning of transformer-based approaches, in particular, is constrained by discrete tokenisation schemes and the risk of catastrophic forgetting during fine-tuning. This work introduces CrystaLLM-{\pi} (property injection), a conditional autoregressive framework that integrates continuous property representations directly into the transformer's attention mechanism. Two architectures, Property-Key-Value (PKV) Prefix attention and PKV Residual attention, are presented. These methods bypass inefficient sequence-level tokenisation and preserve foundational knowledge from unsupervised pre-training on Crystallographic Information Files (CIFs) as textual input. We establish the efficacy of these mechanisms through systematic robustness studies and evaluate the framework's versatility across two distinct tasks. First, for structure recovery, the model processes high-dimensional, heterogeneous X-ray diffraction patterns, achieving structural accuracy competitive with specialised models and demonstrating applications to experimental structure recovery and polymorph differentiation. Second, for materials discovery, the model is fine-tuned on a specialised photovoltaic dataset to generate novel, stable candidates validated by Density Functional Theory (DFT). It implicitly learns to target optimal band gap regions for high photovoltaic efficiency, demonstrating a capability to map complex structure-property relationships. CrystaLLM-{\pi} provides a unified, flexible, and computationally efficient framework for inverse materials design.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Conditional Generative Models Enable Targeted Exploration of MAX Phase Design Space
Conditional generative models double the rate of stable novel MAX phase structures by steering generation with MXene derivative counts and A-site binding energy surrogates, yielding five DFT-stable candidates out of t...
-
PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation
PRISMat generates crystal slabs with mean absolute errors of 0.188 eV/A² for cleavage energy and 2.79 eV for work function, reducing error by 4× versus the next best model while using less inference time.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.