CatDT deploys a self-evolving multi-agent system with UniMech and reinforcement learning to build digital twins of heterogeneous catalysts, matching experimental rates within 0.5-2x on seven benchmarks and identifying competitive non-precious candidates for propane dehydrogenation.
hub Mixed citations
Uma: A family of universal models for atoms
Mixed citation behavior. Most common role is background (60%).
hub tools
citation-role summary
citation-polarity summary
representative citing papers
An E(3)-equivariant deep RL framework lets an O2 agent discover kinetically plausible diffusion and dissociation pathways in disordered Si/a-SiO2 without hand-crafted reaction coordinates or collective variables.
Kernels from pretrained MLIP latent spaces outperform standard acquisition methods in active learning for reactive chemistry, reducing required labels by 38% for energy error and 28% for force error.
PROBE recasts MLIP uncertainty quantification as selective classification by training a compact discriminative classifier on frozen per-atom backbone embeddings, yielding a reliability probability that tracks actual error better than ensemble disagreement.
A closed-loop framework jointly optimizes molecular composition and geometry in multi-component systems, demonstrated by a 30% reduction in activation barrier for a Claisen rearrangement via post-hoc validation.
SOAP and SOAP-Muon optimizers deliver faster convergence and higher final accuracy than Adam for NequIP and Allegro MLIPs, with the largest gains under partial force supervision.
Torched-TACAW enables efficient large-scale STEM-EELS simulations of vibrational and magnon excitations in defective materials by combining ML-driven molecular dynamics with supercell partitioning and on-the-fly multislice processing.
CoTAR reconstructs molecular topology from atomic positions in non-reactive condensed-phase systems using a GNN-HMM hybrid, reporting 0.906 F1 on classical MD data and raising uMLIP valid-snapshot rate from 38.6% to 84.7% after few-shot fine-tuning.
SAP-derived features feed an orbital-based graph neural network that predicts Kohn-Sham Fock matrices, reproducing orbital energies, dipoles, density of states on QM9 and transfer integrals on organic dimers with 4.8 meV MAE on unseen cases.
Distilled compact MLIPs from transfer-learned teachers reproduce observables more reliably than same-size models trained directly and enable practical PIMD umbrella sampling of water dissociation at TiO2 interface with NQE effects matching NMR.
Physics-informed distillation from a universal MLIP plus limited CCSD(T) fine-tuning yields cm^{-1} accurate potentials for non-covalent interactions, with teacher choice strongly affecting accuracy on some systems.
Clari, a unit-cell flow matching model with pair-bias attention, generates organic crystal structures faster than OXtal while improving solve rates and supporting energy-based ranking without relaxation.
Multitask learning on linear-scaling GFN1-xTB orbital charges cuts energy MAE by 46% and data needs by 5x versus energy-only MLIPs while outperforming DFT atomic charge augmentation.
FMwC computes per-sample confidence scores for flow matching models via closed-form propagation of input-dependent multiplicative noise variance along the sampling ODE, supporting filtering, editing, and adaptive stepping.
A reweighting method with mean energy-gap approximation transfers PMFs between MLIPs to recover target reaction and activation free energies at low cost for a 601-atom Li+ transport system across DFT levels.
TSAgent automates transition state searches at DFT accuracy via an agentic loop, reaching 83% success on 100 OC20NEB examples and 70% on 10 held-out cases versus 73% for human experts.
Force-aware Neural Tangent Kernels combined with chunked acquisition provide scalable and distribution-robust active learning for MLIPs, outperforming baselines on OC20 and remaining competitive on other benchmarks.
A new benchmark finds that state-of-the-art ML interatomic potentials struggle with compositional generalization, producing errors an order of magnitude higher on unseen molecular combinations than on training-like cases.
Structural pruning of SO(3) equivariant atomistic models from large checkpoints yields 1.5-4x fewer parameters and 2.5-4x less pre-training compute than small models trained from scratch, while outperforming them on most Matbench Discovery metrics and downstream tasks.
VibroML automates remediation of dynamic instabilities in crystalline materials by combining MLIPs with genetic algorithms for polymorph search, finite-temperature MD validation, and compositional alloying to yield stable structures from databases like Alexandria.
PET-UAFD ensemble of ML potentials, calibrated on experimental cohesive energies and moduli, matches experimental accuracy on liquid properties and supplies uncertainty estimates via the PET-EXP protocol.
Suiren-1.0 is a family of three molecular foundation models (Base, Dimer, ConfAvg) pre-trained on 70M+ DFT samples and distilled to achieve claimed state-of-the-art performance on quantum property prediction tasks from 2D inputs.
UBio-MolFM achieves ab initio-level fidelity on large out-of-distribution biomolecular systems using a new multi-fidelity dataset, E2Former-V2 architecture, and three-stage curriculum learning.
Systematic tests show naive fine-tuning excels for single-task accuracy while multihead replay best preserves out-of-distribution robustness in MLIP adaptation.
citing papers explorer
-
Pretrained Model Representations as Acquisition Signals for Active Learning of MLIPs
Kernels from pretrained MLIP latent spaces outperform standard acquisition methods in active learning for reactive chemistry, reducing required labels by 38% for energy error and 28% for force error.
-
Knowing when to trust machine-learned interatomic potentials
PROBE recasts MLIP uncertainty quantification as selective classification by training a compact discriminative classifier on frozen per-atom backbone embeddings, yielding a reliability probability that tracks actual error better than ensemble disagreement.
-
Beyond Adam: SOAP and Muon for Faster, Label-Efficient Training of Machine Learning Interatomic Potentials
SOAP and SOAP-Muon optimizers deliver faster convergence and higher final accuracy than Adam for NequIP and Allegro MLIPs, with the largest gains under partial force supervision.
-
Fast Organic Crystal Structure Prediction with Unit Cell Flow Matching
Clari, a unit-cell flow matching model with pair-bias attention, generates organic crystal structures faster than OXtal while improving solve rates and supporting energy-based ranking without relaxation.
-
Force-Aware Neural Tangent Kernels for Scalable and Robust Active Learning of MLIPs
Force-aware Neural Tangent Kernels combined with chunked acquisition provide scalable and distribution-robust active learning for MLIPs, outperforming baselines on OC20 and remaining competitive on other benchmarks.
-
Benchmarking Compositional Generalisation for Machine Learning Interatomic Potentials
A new benchmark finds that state-of-the-art ML interatomic potentials struggle with compositional generalization, producing errors an order of magnitude higher on unseen molecular combinations than on training-like cases.
-
Compact SO(3) Equivariant Atomistic Foundation Models via Structural Pruning
Structural pruning of SO(3) equivariant atomistic models from large checkpoints yields 1.5-4x fewer parameters and 2.5-4x less pre-training compute than small models trained from scratch, while outperforming them on most Matbench Discovery metrics and downstream tasks.