hub Mixed citations

Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models

Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Brandon M. Wood, Misko Dzamba, Meng Gao · 2024 · cond-mat.mtrl-sci · arXiv 2410.12771

Mixed citation behavior. Most common role is background (56%).

32 Pith papers citing it

Background 56% of classified citations

open full Pith review browse 32 citing papers arXiv PDF

abstract

The ability to discover new materials with desirable properties is critical for numerous applications from helping mitigate climate change to advances in next generation computing hardware. AI has the potential to accelerate materials discovery and design by more effectively exploring the chemical space compared to other computational methods or by trial-and-error. While substantial progress has been made on AI for materials data, benchmarks, and models, a barrier that has emerged is the lack of publicly available training data and open pre-trained models. To address this, we present a Meta FAIR release of the Open Materials 2024 (OMat24) large-scale open dataset and an accompanying set of pre-trained models. OMat24 contains over 110 million density functional theory (DFT) calculations focused on structural and compositional diversity. Our EquiformerV2 models achieve state-of-the-art performance on the Matbench Discovery leaderboard and are capable of predicting ground-state stability and formation energies to an F1 score above 0.9 and an accuracy of 20 meV/atom, respectively. We explore the impact of model size, auxiliary denoising objectives, and fine-tuning on performance across a range of datasets including OMat24, MPtraj, and Alexandria. The open release of the OMat24 dataset and models enables the research community to build upon our efforts and drive further advancements in AI-assisted materials science.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 5 dataset 3 other 1

citation-polarity summary

background 5 use dataset 3 unclear 1

representative citing papers

SLayerGen: a Crystal Generative Model for all Space and Layer Groups

cond-mat.mtrl-sci · 2026-05-07 · unverdicted · novelty 8.0

SLayerGen generates crystals invariant to any space or layer group via autoregressive lattice and Wyckoff sampling plus equivariant diffusion, achieving gains over bulk models on diperiodic materials after correcting a prior loss inconsistency for hexagonal groups.

JanusPipe: Efficient Pipeline Parallel Training for Machine Learning Interatomic Potentials

cs.DC · 2026-05-18 · unverdicted · novelty 7.0 · 2 refs

JanusPipe introduces SymFold and WaveK to enable efficient 3D-parallel training for conservative MLIPs, reporting 1.51x and 1.45x average throughput gains over 1F1B and Hanayo baselines on 32 GPUs.

Lang2MLIP: End-to-End Language-to-Machine Learning Interatomic Potential Development with Autonomous Agentic Workflows

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Lang2MLIP is an LLM multi-agent framework that automates end-to-end development of machine learning interatomic potentials from natural language input for heterogeneous materials systems.

Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials

cs.DC · 2026-04-17 · conditional · novelty 7.0

MatRIS-MoE and Janus enable efficient exascale training of billion-parameter universal interatomic potentials by addressing second-order derivative computation and communication overheads.

Atomistic Machine Learning with Irreducible Cartesian Natural Tensors

cond-mat.mtrl-sci · 2025-10-05 · unverdicted · novelty 7.0

CarNet develops irreducible Cartesian natural tensors and an equivariant model that matches leading spherical-tensor performance for ML interatomic potentials and high-rank tensor predictions like elastic constants.

Teachers that teach the irrelevant: Pre-training machine learned interaction potentials with classical force fields for robust molecular dynamics simulations

physics.chem-ph · 2025-09-17 · unverdicted · novelty 7.0

Pre-training ML interaction potentials on classical force fields followed by ab initio fine-tuning produces stable and accurate molecular dynamics simulations for gas-phase molecules, liquid water, and hydrogen combustion.

Can MACE Potentials Accurately Describe Magnetism and Phase Stability in Fe-Ni Alloys? A Systematic Benchmark

cond-mat.mtrl-sci · 2026-05-27 · accept · novelty 6.0

A system-specific MACE-sqs model trained on spin-polarized PBE DFT data for Fe-Ni SQS structures outperforms foundation models for equations of state, volumes, elastic constants and thermal expansion but all models incorrectly increase bcc-to-hcp transition pressure with Ni content.

CrystalREPA: Transferring Physical Priors from Universal MLIPs to Crystal Generative Models

cond-mat.mtrl-sci · 2026-05-09 · unverdicted · novelty 6.0

CrystalREPA closes the representation gap between crystal generators and universal MLIPs via contrastive alignment, yielding more stable and valid generated crystals while revealing that MLIP teacher quality is better predicted by representation distinguishability than by leaderboard accuracy.

Compact SO(3) Equivariant Atomistic Foundation Models via Structural Pruning

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

Structural pruning of SO(3) equivariant atomistic models from large checkpoints yields 1.5-4x fewer parameters and 2.5-4x less pre-training compute than small models trained from scratch, while outperforming them on most Matbench Discovery metrics and downstream tasks.

Density diversity in training data governs thermodynamic transferability of machine learning interatomic potentials

physics.chem-ph · 2026-05-07 · unverdicted · novelty 6.0

Density diversity in training data is the key factor for making machine learning interatomic potentials transferable across thermodynamic states, outperforming temperature diversity.

VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentials

cond-mat.mtrl-sci · 2026-04-30 · unverdicted · novelty 6.0

VibroML automates remediation of dynamic instabilities in crystalline materials by combining MLIPs with genetic algorithms for polymorph search, finite-temperature MD validation, and compositional alloying to yield stable structures from databases like Alexandria.

Errors that matter: Uncertainty-aware universal machine-learning potentials calibrated on experiments

physics.chem-ph · 2026-04-27 · conditional · novelty 6.0

PET-UAFD ensemble of ML potentials, calibrated on experimental cohesive energies and moduli, matches experimental accuracy on liquid properties and supplies uncertainty estimates via the PET-EXP protocol.

Agentic Fusion of Large Atomic and Language Models to Accelerate Superconductor Discovery

cs.LG · 2026-04-26 · unverdicted · novelty 6.0

An agentic framework fusing large atomic and language models rediscovers 66 known superconductors and guides experimental verification of four new ones with transition temperatures from 2.5 K to 6.5 K.

AI-Driven Expansion and Application of the Alexandria Database

cond-mat.mtrl-sci · 2025-12-09 · accept · novelty 6.0

A combined generative model, ML potential, and graph neural network pipeline expands the Alexandria database by 1.3 million DFT-validated compounds with 99% success near the convex hull and releases training data for universal force fields.

An experimentally validated end-to-end framework for operando modeling of intrinsically complex metallosilicates

cond-mat.mtrl-sci · 2025-12-02 · conditional · novelty 6.0

An end-to-end framework combining domain separation, lightweight ML potentials, and de novo in silico synthesis enables quantitative atomistic modeling of mesoporous metallosilicates that matches experimental densities, pair distribution functions, IR spectra, and hydroxyl densities.

Machine Learning Phonon Spectra for Fast and Accurate Optical Lineshapes of Defects

cond-mat.mtrl-sci · 2025-08-12 · unverdicted · novelty 6.0

Machine learning interatomic potentials fine-tuned on first-principles relaxation data accurately reproduce phonon spectra and optical lineshapes for defects, matching explicit calculations and experiments.

Universal Interatomic Potentials as Configuration-Space Generators for One-Shot and Iterative Fine-Tuning of Ab Initio-Accurate Material-Specific Models

cond-mat.mtrl-sci · 2026-06-22 · unverdicted · novelty 5.0

Universal MLIPs serve as configuration generators whose DFT-relabeled subsamples enable one-shot or iterative training of material-specific MLIPs that recover accurate reactive energy profiles with 600-2000 DFT calculations.

Systematic Fine-Tuning of MACE Interatomic Potentials for Catalysis

physics.chem-ph · 2026-05-10 · conditional · novelty 5.0

Fine-tuned MACE MLIPs achieve lower mean absolute errors on catalytic reaction energies and barriers than from-scratch models, with a large fine-tuned model performing best on both metallic and oxide systems including out-of-distribution cases.

MatterSim-MT: A multi-task foundation model for in silico materials characterization

cond-mat.mtrl-sci · 2026-05-08 · unverdicted · novelty 5.0 · 2 refs

MatterSim-MT is a multi-task ML foundation model pretrained on 35M+ structures for in silico materials property prediction and complex simulations.

OptiMat Alloys: a FAIR, living database of multi-principal element alloys enabled by a conversational agent

cond-mat.mtrl-sci · 2026-04-23 · unverdicted · novelty 5.0

OptiMat Alloys is a conversational AI system that maintains a living FAIR database of multi-principal element alloy calculations and enables natural-language, on-demand computations with built-in uncertainty checks.

Accuracy and Efficiency Benchmarks of Pretrained Machine Learning Potentials for Molecular Simulations

physics.chem-ph · 2026-01-22 · unverdicted · novelty 5.0

Benchmarks of 15 MLIPs show parameter count and training set size correlate with accuracy, architecture drives speed and memory, and explicit Coulomb terms provide no benefit.

Comparing the latent features of universal machine-learning interatomic potentials

physics.chem-ph · 2025-12-05 · unverdicted · novelty 5.0

Different uMLIPs encode chemical space in distinct ways, with high cross-model feature reconstruction errors, and fine-tuning preserves strong pre-training bias in the latent features.

Tailored Vapor Deposition Unlocks Large-Grain, Wafer-Scale Epitaxial Growth of 2D Magnetic CrCl3

cond-mat.mtrl-sci · 2025-05-22 · unverdicted · novelty 5.0

Centimeter-scale epitaxial growth of phase-pure crystalline 2D CrCl3 films achieved on mica via controlled physical vapor transport with innovations in light management, high carrier-gas flow, and moisture control.

Fine-Tuning a Universal Machine-Learned Interatomic Potential for Oxygen Plasma Interactions with WS$_2$

cond-mat.mtrl-sci · 2026-06-19 · unverdicted · novelty 4.0

Pretrained UMA model reproduces chemisorbed S and O coverage under 15 eV O+ and O2+ bombardment on WS2 without fine-tuning; fine-tuning lowers energy MAE to 4.5e-3 eV/atom and force MAE to 0.076 eV/Å.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Lang2MLIP: End-to-End Language-to-Machine Learning Interatomic Potential Development with Autonomous Agentic Workflows cs.LG · 2026-05-14 · unverdicted · none · ref 79 · internal anchor
Lang2MLIP is an LLM multi-agent framework that automates end-to-end development of machine learning interatomic potentials from natural language input for heterogeneous materials systems.
Compact SO(3) Equivariant Atomistic Foundation Models via Structural Pruning cs.LG · 2026-05-09 · unverdicted · none · ref 32 · internal anchor
Structural pruning of SO(3) equivariant atomistic models from large checkpoints yields 1.5-4x fewer parameters and 2.5-4x less pre-training compute than small models trained from scratch, while outperforming them on most Matbench Discovery metrics and downstream tasks.
Agentic Fusion of Large Atomic and Language Models to Accelerate Superconductor Discovery cs.LG · 2026-04-26 · unverdicted · none · ref 89 · internal anchor
An agentic framework fusing large atomic and language models rediscovers 66 known superconductors and guides experimental verification of four new ones with transition temperatures from 2.5 K to 6.5 K.
An experimentally validated end-to-end framework for operando modeling of intrinsically complex metallosilicates cond-mat.mtrl-sci · 2025-12-02 · conditional · none · ref 58 · internal anchor
An end-to-end framework combining domain separation, lightweight ML potentials, and de novo in silico synthesis enables quantitative atomistic modeling of mesoporous metallosilicates that matches experimental densities, pair distribution functions, IR spectra, and hydroxyl densities.
MatterSim-MT: A multi-task foundation model for in silico materials characterization cond-mat.mtrl-sci · 2026-05-08 · unverdicted · none · ref 19 · 2 links · internal anchor
MatterSim-MT is a multi-task ML foundation model pretrained on 35M+ structures for in silico materials property prediction and complex simulations.

Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer