pith. sign in

arxiv: 2509.07150 · v4 · pith:MVC2DGCVnew · submitted 2025-09-08 · 💻 cs.LG · cond-mat.mtrl-sci

PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design

classification 💻 cs.LG cond-mat.mtrl-sci
keywords materialsplaidconstraintsdesignencouragesgeneratesgenerationintroduce
0
0 comments X
read the original abstract

Reinforcement Learning from Verifiable Rewards (RLVR) has emerged as a promising approach to improve correctness in LLMs, however, in many scientific problems, the objective is not necessarily to produce the correct answer, but instead to produce a diverse array of candidates which satisfy a set of constraints. We study this challenge in the context of materials generation. To this end, we introduce PLaID++, an LLM post-trained for stable and property-guided crystal generation. We find that performance hinges on our crystallographic representation and reward formulation. First, we introduce a compact, symmetry-informed Wyckoff text representation which improves computational efficiency and encourages generalization from physical priors. Second, we demonstrate that temperature scaling acts as an entropy regularizer which counteracts mode collapse and encourages exploration. By encoding symmetry constraints directly into text and guiding model outputs towards desirable chemical space, PLaID++ generates structures that are thermodynamically stable, unique, and novel at a $\sim$50\% greater rate than prior methods and conditionally generates structures with desired space group properties. Our work demonstrates the potential of adapting post-training techniques from natural language processing to materials design, paving the way for targeted and efficient discovery of novel materials.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CrystalReasoner: Reasoning and RL for Property-Conditioned Crystal Structure Generation

    cs.AI 2026-05 unverdicted novelty 6.0

    CrystalReasoner combines LLM reasoning traces with physical priors and multi-objective RL to generate valid, stable, and property-conditioned crystal structures.

  2. Generative design of inorganic materials

    cond-mat.mtrl-sci 2026-04 unverdicted novelty 3.0

    A perspective advocating an integrated foundation AI model for inorganic materials that connects generative design, multi-modal databases, and experimental validation to address data-driven inverse design challenges.