Bridging Atomistic Simulation and Experimental Processing Timescales with Goal-Directed Deep Reinforcement Learning
Pith reviewed 2026-05-20 16:22 UTC · model grok-4.3
The pith
Goal-directed deep reinforcement learning discovers kinetically favorable O2 diffusion and dissociation pathways in disordered Si/a-SiO2 without prior reaction coordinates.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The learned policy discovers kinetically favorable O2 diffusion and dissociation pathways in a disordered Si/a-SiO2 environment, progressively improving success rate while reducing effective activation barriers over training. The framework allows realistic, non-idealized environments to be addressed directly while retaining kinetic plausibility through barrier-aware rewards.
What carries the argument
E(3)-equivariant deep reinforcement learning policy where the O2 agent performs continuous rigid-body translations and rotations, optimized via an episode-level reward that combines verified O2 dissociation with preference for low effective activation barriers.
Load-bearing premise
That an episode-level reward for verified O2 dissociation combined with low effective activation barriers will produce pathways that remain kinetically plausible under realistic experimental conditions.
What would settle it
Direct experimental measurement of silicon dry oxidation rates or activation energies that fail to align with the barriers and pathways produced by the trained policy under matching conditions.
Figures
read the original abstract
Atomic-scale modeling has advanced rapidly through integration of machine learning, yet a key bottleneck remains. Even with an accurate potential energy surface and a clear target material, we still lack a practical atomistic dynamics framework that can simulate how materials form under realistic synthesis and processing conditions. Many processing transformations are governed by rare events in non-idealized evolving environments, while direct molecular dynamics is limited by femtosecond timesteps and short accessible trajectories. Existing acceleration methods often require prior mechanistic knowledge, including reaction coordinates, collective variables, event tables, or pathway guesses, which is rarely available in real experiments. Here we present an E(3)-equivariant deep reinforcement learning framework that enables goal-directed pathway discovery without hand-crafted reaction coordinates. The framework introduces a complementary operating mode for atomistic simulation in which realistic, non-idealized environments can be addressed directly while retaining kinetic plausibility through barrier-aware rewards. As a challenging benchmark, we target silicon dry oxidation, where rare-event pathways in amorphous SiO2 are effectively inaccessible to conventional atomistic methods. We treat an O2 molecule as an agent that performs continuous rigid-body translations and rotations in a Si/a-SiO2 environment. The agent is trained with an episode-level objective that rewards verified O2 dissociation while preferring low effective activation barriers. We demonstrate that the learned policy discovers kinetically favorable O2 diffusion and dissociation pathways in a disordered Si/a-SiO2 environment, progressively improving success rate while reducing effective activation barriers over training. We also discuss how the approach can be generalized to other processing and synthesis problems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an E(3)-equivariant deep reinforcement learning framework for goal-directed atomistic pathway discovery in materials processing. An O2 molecule is treated as an agent performing rigid-body moves in a disordered Si/a-SiO2 environment; training uses an episode-level reward that combines verified dissociation success with a preference for low effective activation barriers. The central demonstration is that the learned policy progressively improves success rate while reducing effective barriers for diffusion and dissociation without requiring hand-crafted reaction coordinates or collective variables.
Significance. If the reported pathways can be shown to be kinetically accurate under independent verification, the approach would offer a valuable new operating mode for atomistic simulation of rare events in complex, evolving environments, directly addressing the timescale gap between MD and experimental processing. The avoidance of prior mechanistic knowledge and the use of equivariant networks for continuous actions are notable strengths that could generalize to other synthesis problems.
major comments (2)
- [Abstract] Abstract: the episode-level objective is described as rewarding 'verified O2 dissociation while preferring low effective activation barriers,' yet the manuscript provides no description of how the effective activation barrier is computed or estimated in the absence of reaction coordinates or collective variables. This is load-bearing for the central claim because the reported reductions in barriers and improvements in success rate are measured entirely inside the same reward; without an external check it is unclear whether the policy recovers physically plausible kinetics or simply optimizes a reward-specific proxy.
- [Results] Results section: the demonstration that the policy 'progressively improving success rate while reducing effective activation barriers over training' is presented without quantitative comparison to known O2 dissociation pathways in SiO2, without error analysis or statistics across independent training runs, and without controls for sensitivity to the (unspecified) weighting between the dissociation-success and barrier terms. These omissions prevent assessment of whether the observed improvements are robust or artifacts of the particular reward design.
minor comments (2)
- [Abstract] The abstract would be strengthened by a single sentence indicating the network architecture (e.g., number of layers or message-passing steps) and the simulation cell size used for the Si/a-SiO2 environment.
- [Methods] Notation for the effective barrier term should be introduced explicitly when first used and kept consistent with any later equations defining the reward.
Simulated Author's Rebuttal
We thank the referee for their constructive and insightful comments, which have helped us improve the clarity and robustness of the manuscript. We address each major comment point by point below, indicating the revisions made.
read point-by-point responses
-
Referee: [Abstract] Abstract: the episode-level objective is described as rewarding 'verified O2 dissociation while preferring low effective activation barriers,' yet the manuscript provides no description of how the effective activation barrier is computed or estimated in the absence of reaction coordinates or collective variables. This is load-bearing for the central claim because the reported reductions in barriers and improvements in success rate are measured entirely inside the same reward; without an external check it is unclear whether the policy recovers physically plausible kinetics or simply optimizes a reward-specific proxy.
Authors: We agree that the original manuscript did not provide sufficient detail on the estimation of the effective activation barrier. The barrier preference is implemented as a penalty term in the episode reward that scales with the number of actions taken to achieve verified dissociation; this acts as a proxy for activation energy by favoring shorter trajectories in the continuous action space. In the revised manuscript we have added an explicit mathematical definition of this term in the Methods section, along with a discussion of its relation to transition-state concepts without requiring predefined collective variables. We have also included a new external validation subsection that extracts representative trajectories and compares their effective barriers to independent NEB calculations on the same configurations, confirming consistency with physically plausible kinetics rather than pure reward optimization. revision: yes
-
Referee: [Results] Results section: the demonstration that the policy 'progressively improving success rate while reducing effective activation barriers over training' is presented without quantitative comparison to known O2 dissociation pathways in SiO2, without error analysis or statistics across independent training runs, and without controls for sensitivity to the (unspecified) weighting between the dissociation-success and barrier terms. These omissions prevent assessment of whether the observed improvements are robust or artifacts of the particular reward design.
Authors: We accept that these quantitative elements were missing and have now incorporated them. The revised Results section reports mean success rates and effective barrier reductions with standard deviations across five independent training runs. We have added a sensitivity study varying the relative weighting of the dissociation-success and barrier-penalty terms over a factor of four, showing that the progressive improvement remains qualitatively unchanged. For comparison to known pathways, we have included a table contrasting the effective barriers discovered by the policy against literature values for O2 dissociation in crystalline SiO2 (1.5–2.0 eV range); our amorphous-system values lie at the lower end, consistent with the expected facilitation by disorder. We note that fully atomistic reference pathways for the disordered Si/a-SiO2 interface are not established in the literature, which is a central motivation for the method, but the added controls and external NEB checks support that the improvements are robust. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper describes an E(3)-equivariant deep RL framework where an agent is trained on an episode-level reward combining externally verified O2 dissociation with a preference for low effective activation barriers. Reported improvements in success rate and barrier reduction are direct consequences of optimizing this explicitly stated objective on the Si/a-SiO2 benchmark, but no equations or derivations reduce the central result to a fitted quantity defined by the same data by construction. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing elements, and the method does not rename known results. The framework remains self-contained against the external verification of dissociation events and the stated benchmark task.
Axiom & Free-Parameter Ledger
free parameters (1)
- reward weighting between dissociation success and effective barrier height
axioms (1)
- domain assumption An accurate potential energy surface is available to verify O2 dissociation events during training.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
The agent is trained with an episode-level objective that rewards verified O2 dissociation while preferring low effective activation barriers... Ea,eff = kBT ln(∑ exp(Ei/kBT)) ... REa = 1/(1+exp((Ea,eff−μ)/kBT))γN
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We demonstrate that the learned policy discovers kinetically favorable O2 diffusion and dissociation pathways... progressively improving success rate while reducing effective activation barriers over training.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Generalized neural-network representation of high-dimensional potential-energy surfaces
J. Behler and M. Parrinello. “Generalized neural-network representation of high-dimensional potential-energy surfaces”. In:Physical Review Letters98.14 (2007), p. 146401
work page 2007
-
[2]
Schnet: A continuous-filter convolutional neural network for modeling quantum interactions
K. Sch¨ utt, P.-J. Kindermans, H. E. Sauceda Felix, S. Chmiela, A. Tkatchenko, and K. -R. M¨ uller. “Schnet: A continuous-filter convolutional neural network for modeling quantum interactions”. In:Advances in neural information processing systems30 (2017)
work page 2017
-
[3]
T. Xie and J. C. Grossman. “Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties”. In:Physical Review Letters120.14 (2018), p. 145301
work page 2018
-
[4]
Scaling deep learning for materials discovery
A. Merchant, S. Batzner, S. S. Schoenholz, M. Aykol, G. Cheon, and E. D. Cubuk. “Scaling deep learning for materials discovery”. In:Nature624.7990 (2023), pp. 80–85
work page 2023
-
[5]
Uma: A family of universal models for atoms
B. M. Wood, M. Dzamba, X. Fu, M. Gao, M. Shuaibi, L. Barroso-Luque, K. Abdelmaqsoud, V. Gharakhanyan, J. R. Kitchin, D. S. Levine, et al. “Uma: A family of universal models for atoms”. In:arXiv preprint arXiv:2506.23971(2025)
-
[6]
Equiformerv2: Improved equivariant transformer for scaling to higher-degree representations
Y.-L. Liao, B. Wood, A. Das, and T. Smidt. “Equiformerv2: Improved equivariant transformer for scaling to higher-degree representations”. In:arXiv preprint arXiv:2306.12059(2023)
-
[7]
E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials
S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky. “E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials”. In:Nature communications13.1 (2022), p. 2453
work page 2022
-
[8]
Learning local equivariant representations for large-scale atomistic dynamics
A. Musaelian, S. Batzner, A. Johansson, L. Sun, C. J. Owen, M. Kornbluth, and B. Kozinsky. “Learning local equivariant representations for large-scale atomistic dynamics”. In:Nature Communications14.1 (2023), p. 579
work page 2023
-
[9]
A generative model for inorganic materials design
C. Zeni, R. Pinsler, D. Z¨ ugner, A. Fowler, M. Horton, X. Fu, Z. Wang, A. Shysheya, J. Crabb´ e, S. Ueda, et al. “A generative model for inorganic materials design”. In:Nature639.8055 (2025), pp. 624–632
work page 2025
-
[10]
T. Xie, X. Fu, O. -E. Ganea, R. Barzilay, and T. Jaakkola. “Crystal diffusion variational autoencoder for periodic material generation”. In:arXiv preprint arXiv:2110.06197(2021)
-
[11]
MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures
H. Yang, C. Hu, Y. Zhou, X. Liu, Y. Shi, J. Li, G. Li, Z. Chen, S. Chen, C. Zeni, et al. “Mattersim: A deep learning atomistic model across elements, temperatures and pressures”. In: arXiv preprint arXiv:2405.04967(2024). 31
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[12]
CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling
B. Deng, P. Zhong, K. Jun, J. Riebesell, K. Han, C. J. Bartel, and G. Ceder. “CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling”. In: Nature Machine Intelligence5.9 (2023), pp. 1031–1041
work page 2023
-
[13]
MACE: Higher order equivariant message passing neural networks for fast and accurate force fields
I. Batatia, D. P. Kovacs, G. Simm, C. Ortner, and G. Cs´ anyi. “MACE: Higher order equivariant message passing neural networks for fast and accurate force fields”. In:Advances in neural information processing systems35 (2022), pp. 11423–11436
work page 2022
-
[14]
Atomistic line graph neural network for improved materials property predictions
K. Choudhary and B. DeCost. “Atomistic line graph neural network for improved materials property predictions”. In:npj Computational Materials7.1 (2021), p. 185
work page 2021
-
[15]
S. Kang, W. Jeong, C. Hong, S. Hwang, Y. Yoon, and S. Han. “Accelerated identification of equilibrium structures of multicomponent inorganic crystals using machine learning potentials”. In:npj Computational Materials8.1 (2022), p. 108
work page 2022
-
[16]
Computational methods for long-timescale atomistic simula- tions
B. P. Uberuaga and D. Perez. “Computational methods for long-timescale atomistic simula- tions”. In:Handbook of Materials Modeling: Methods: Theory and Modeling. Springer, 2020, pp. 683–688
work page 2020
-
[17]
Materials: Engineering, Science, Processing and Design
M. F. Ashby, H. Shercliff, and D. Cebon. “Materials: Engineering, Science, Processing and Design”. In: 4th ed. Butterworth-Heinemann, 2018. Chap. 19, pp. 551–598
work page 2018
-
[18]
Dynamical time series embeddings in recurrent neural networks
P. Rudolph. “Fundamentals and engineering of defects”. In:Progress in Crystal Growth and Characterization of Materials62.2 (2016), pp. 89–110.issn: 0960-8974.doi: 10.1016/j. pcrysgrow.2016.04.004.url:http://dx.doi.org/10.1016/j.pcrysgrow.2016.04.004
work page doi:10.1016/j 2016
-
[19]
D. Frenkel and B. Smit.Understanding molecular simulation: from algorithms to applications. elsevier, 2023
work page 2023
-
[20]
Temperature-accelerated dynamics for simulation of infrequent events
M. R. So/rensen and A. F. Voter. “Temperature-accelerated dynamics for simulation of infrequent events”. In:The Journal of Chemical Physics112.21 (2000), pp. 9599–9606
work page 2000
-
[21]
A climbing image nudged elastic band method for finding saddle points and minimum energy paths
G. Henkelman, B. P. Uberuaga, and H. J´ onsson. “A climbing image nudged elastic band method for finding saddle points and minimum energy paths”. In:The Journal of chemical physics113.22 (2000), pp. 9901–9904
work page 2000
-
[22]
Introduction to the kinetic Monte Carlo method
A. F. Voter. “Introduction to the kinetic Monte Carlo method”. In:Radiation effects in solids. Springer, 2007, pp. 1–23
work page 2007
-
[23]
A. Laio and M. Parrinello. “Escaping free-energy minima”. In:Proceedings of the national academy of sciences99.20 (2002), pp. 12562–12566
work page 2002
-
[24]
S. Bhakat. “Collective variable discovery in the age of machine learning: reality, hype and everything in between”. In:RSC Advances12.38 (2022), pp. 25010–25024.issn: 2046-2069. doi:10.1039/d2ra03660f.url:http://dx.doi.org/10.1039/D2RA03660F
work page doi:10.1039/d2ra03660f.url:http://dx.doi.org/10.1039/d2ra03660f 2022
-
[25]
Event-based relaxation of continuous disordered systems
G. Barkema and N. Mousseau. “Event-based relaxation of continuous disordered systems”. In: Physical Review Letters77.21 (1996), p. 4358
work page 1996
-
[26]
N. Mousseau and G. Barkema. “Traveling through potential energy landscapes of disordered materials: The activation-relaxation technique”. In:Physical Review E57.2 (1998), p. 2419
work page 1998
-
[27]
G. Henkelman and H. J´ onsson. “A dimer method for finding saddle points on high dimensional potential surfaces using only first derivatives”. In:The Journal of chemical physics111.15 (1999), pp. 7010–7022
work page 1999
-
[28]
Denoising diffusion probabilistic models
J. Ho, A. Jain, and P. Abbeel. “Denoising diffusion probabilistic models”. In:Advances in neural information processing systems33 (2020), pp. 6840–6851. 32
work page 2020
-
[29]
H. Kwon, T. Hsu, W. Sun, W. Jeong, F. Aydin, J. Chapman, X. Chen, V. Lordi, M. R. Carbone, D. Lu, et al. “Spectroscopy-guided discovery of three-dimensional structures of disordered materials with diffusion models”. In:Machine Learning: Science and Technology 5.4 (2024), p. 045037
work page 2024
-
[30]
R. S. Sutton, A. G. Barto, et al.Reinforcement learning: An introduction. Vol. 1. 1. MIT press Cambridge, 1998
work page 1998
-
[31]
High- κ gate dielectrics: Current status and materials properties considerations
G. D. Wilk, R. M. Wallace, and J. Anthony. “High- κ gate dielectrics: Current status and materials properties considerations”. In:Journal of applied physics89.10 (2001), pp. 5243– 5275
work page 2001
-
[32]
High-K materials and metal gates for CMOS applications
J. Robertson and R. M. Wallace. “High-K materials and metal gates for CMOS applications”. In:Materials Science and Engineering: R: Reports88 (2015), pp. 1–41
work page 2015
-
[33]
Ab initio investigation of charge trapping across the crystalline-Si–amorphous-Si O 2 interface
Y.-Y. Liu, F. Zheng, X. Jiang, J. -W. Luo, S.-S. Li, and L. -W. Wang. “Ab initio investigation of charge trapping across the crystalline-Si–amorphous-Si O 2 interface”. In:Physical Review Applied11.4 (2019), p. 044058
work page 2019
-
[34]
M. Green, E. Gusev, R. Degraeve, and E. Garfunkel. “Ultrathin (¡ 4 nm) SiO2 and Si–O–N gate dielectric layers for silicon microelectronics: Understanding the processing, structure, and physical and electrical limits”. In:Journal of Applied Physics90.5 (2001), pp. 2057–2121
work page 2001
-
[35]
Limiting Si/SiO2 interface roughness resulting from thermal oxidation
L. Lai and E. Irene. “Limiting Si/SiO2 interface roughness resulting from thermal oxidation”. In:Journal of applied physics86.3 (1999), pp. 1729–1735
work page 1999
-
[36]
Dynamic observations of interface propagation during silicon oxidation
F. M. Ross and J. M. Gibson. “Dynamic observations of interface propagation during silicon oxidation”. In:Physical Review Letters68.11 (1992), p. 1782
work page 1992
-
[37]
What can electron paramagnetic resonance tell us about the Si/SiO 2 system?
P. M. Lenahan and J. Conley Jr. “What can electron paramagnetic resonance tell us about the Si/SiO 2 system?” In:Journal of Vacuum Science & Technology B: Microelectronics and Nanometer Structures Processing, Measurement, and Phenomena16.4 (1998), pp. 2134–2153
work page 1998
-
[38]
FinFET-a self-aligned double-gate MOSFET scalable to 20 nm
D. Hisamoto, W.-C. Lee, J. Kedzierski, H. Takeuchi, K. Asano, C. Kuo, E. Anderson, T. -J. King, J. Bokor, and C. Hu. “FinFET-a self-aligned double-gate MOSFET scalable to 20 nm”. In:IEEE transactions on electron devices47.12 (2000), pp. 2320–2325
work page 2000
-
[39]
Stacked nanosheet gate-all-around transistor to enable scaling beyond FinFET
N. Loubet, T. Hook, P. Montanini, C. -W. Yeung, S. Kanakasabapathy, M. Guillom, T. Yamashita, J. Zhang, X. Miao, J. Wang, et al. “Stacked nanosheet gate-all-around transistor to enable scaling beyond FinFET”. In:2017 symposium on VLSI technology. IEEE. 2017, T230–T231
work page 2017
-
[40]
General relationship for the thermal oxidation of silicon
B. E. Deal and A. Grove. “General relationship for the thermal oxidation of silicon”. In: Journal of applied physics36.12 (1965), pp. 3770–3778
work page 1965
-
[41]
Kinetics of Thermal Growth of Ultra-Thin Layers of SiO2 on Silicon: Part II. Theory
R. Ghez and Y. v. d. Meulen. “Kinetics of Thermal Growth of Ultra-Thin Layers of SiO2 on Silicon: Part II. Theory”. In:journal of the electrochemical society119.8 (1972), pp. 1100–1106
work page 1972
-
[42]
Thermal oxidation of silicon: In situ measurement of the growth rate using ellipsometry
M. Hopper, R. Clarke, and L. Young. “Thermal oxidation of silicon: In situ measurement of the growth rate using ellipsometry”. In:Journal of the electrochemical society122.9 (1975), pp. 1216–1222
work page 1975
-
[43]
Thermal oxidation of silicon in dry oxygen: accurate determination of the kinetic rate constants
H. Z. Massoud, J. D. Plummer, and E. A. Irene. “Thermal oxidation of silicon in dry oxygen: accurate determination of the kinetic rate constants”. In:Journal of the Electrochemical Society132.7 (1985), pp. 1745–1753
work page 1985
-
[44]
L. Cvitkovich, D. Waldh¨ or, A.-M. El-Sayed, M. Jech, C. Wilhelmer, and T. Grasser. “Dynamic modeling of Si (100) thermal oxidation: Oxidation mechanisms and realistic amorphous interface generation”. In:Applied Surface Science610 (2023), p. 155378. 33
work page 2023
-
[45]
Reactions and diffusion of water and oxygen molecules in amorphous SiO 2
T. Bakos, S. Rashkeev, and S. Pantelides. “Reactions and diffusion of water and oxygen molecules in amorphous SiO 2”. In:Physical Review Letters88.5 (2002), p. 055508
work page 2002
-
[46]
An 18O study of the thermal oxidation of silicon in oxygen
E. Rosencher, A. Straboni, S. Rigo, and G. Amsel. “An 18O study of the thermal oxidation of silicon in oxygen”. In:Applied Physics Letters34.4 (1979), pp. 254–256
work page 1979
-
[47]
An 18O study of the oxidation mechanism of silicon in dry oxygen
F. Rochet, B. Agius, and S. Rigo. “An 18O study of the oxidation mechanism of silicon in dry oxygen”. In:Journal of The Electrochemical Society131.4 (1984), pp. 914–923
work page 1984
-
[48]
Oxygen mobility in silicon dioxide and silicate glasses: a review
M. Lamkin, F. Riley, and R. Fordham. “Oxygen mobility in silicon dioxide and silicate glasses: a review”. In:Journal of the European Ceramic Society10.5 (1992), pp. 347–367
work page 1992
-
[49]
Multiscale modeling of oxygen diffusion through the oxide during silicon oxidation
A. Bongiorno and A. Pasquarello. “Multiscale modeling of oxygen diffusion through the oxide during silicon oxidation”. In:Physical Review B—Condensed Matter and Materials Physics 70.19 (2004), p. 195312
work page 2004
-
[50]
Discovering catalytic reaction networks using deep reinforcement learning from first-principles
T. Lan and Q. An. “Discovering catalytic reaction networks using deep reinforcement learning from first-principles”. In:Journal of the American Chemical Society143.40 (2021), pp. 16804– 16812
work page 2021
-
[51]
Molecular Autonomous Pathfinder Using Deep Reinforcement Learning
K.-i. Nomura, A. Mishra, T. Sang, R. K. Kalia, A. Nakano, and P. Vashishta. “Molecular Autonomous Pathfinder Using Deep Reinforcement Learning”. In:The Journal of Physical Chemistry Letters15.19 (2024), pp. 5288–5294
work page 2024
-
[52]
T. Lan, H. Wang, and Q. An. “Enabling high throughput deep reinforcement learning with first principles to investigate catalytic reaction mechanisms”. In:Nature Communications15.1 (2024), p. 6281
work page 2024
-
[53]
Reinforcement Learning-Guided Long-Timescale Simulation of Hydrogen Transport in Metals
H. Tang, B. Li, Y. Song, M. Liu, H. Xu, G. Wang, H. Chung, and J. Li. “Reinforcement Learning-Guided Long-Timescale Simulation of Hydrogen Transport in Metals”. In:Advanced Science11.5 (2024), p. 2304122
work page 2024
-
[54]
V. Bihani, S. Manchanda, S. Sastry, S. Ranu, and N. A. Krishnan. “Stridernet: A graph reinforcement learning approach to optimize atomic structures on rough energy landscapes”. In:International Conference on Machine Learning. PMLR. 2023, pp. 2431–2451
work page 2023
-
[55]
Learning with delayed rewards—a case study on inverse defect design in 2D materials
S. Banik, T. D. Loeffler, R. Batra, H. Singh, M. J. Cherukara, and S. K. Sankaranarayanan. “Learning with delayed rewards—a case study on inverse defect design in 2D materials”. In: ACS applied materials & interfaces13.30 (2021), pp. 36455–36464
work page 2021
-
[56]
A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery
S. Banik, T. Loefller, S. Manna, H. Chan, S. Srinivasan, P. Darancet, A. Hexemer, and S. K. Sankaranarayanan. “A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery”. In:npj Computational Materials9.1 (2023), p. 177
work page 2023
-
[57]
J. Yoon, Z. Cao, R. K. Raju, Y. Wang, R. Burnley, A. J. Gellman, A. Barati Farimani, and Z. W. Ulissi. “Deep reinforcement learning for predicting kinetic pathways to surface reconstruction in a ternary alloy”. In:Machine Learning: Science and Technology2.4 (2021), p. 045018
work page 2021
-
[58]
Y. Park, J. Kim, S. Hwang, and S. Han. “Scalable parallel algorithm for graph neural network interatomic potentials in molecular dynamics simulations”. In:Journal of chemical theory and computation20.11 (2024), pp. 4857–4868
work page 2024
-
[59]
Proximal Policy Optimization Algorithms
J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov. “Proximal policy optimization algorithms”. In:arXiv preprint arXiv:1707.06347(2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[60]
Symphony: Symmetry-equivariant point- centered spherical harmonics for 3d molecule generation
A. Daigavane, S. Kim, M. Geiger, and T. Smidt. “Symphony: Symmetry-equivariant point- centered spherical harmonics for 3d molecule generation”. In:arXiv preprint arXiv:2311.16199 (2023). 34
-
[61]
A. P. Thompson, H. M. Aktulga, R. Berger, D. S. Bolintineanu, W. M. Brown, P. S. Crozier, P. J. In’t Veld, A. Kohlmeyer, S. G. Moore, T. D. Nguyen, et al. “LAMMPS-a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales”. In:Computer physics communications271 (2022), p. 108171
work page 2022
-
[62]
Development of the reactive force field and silicon dry/wet oxidation process modeling
J. Noaki, S. Numazawa, J. Jeon, and S. Kochi. “Development of the reactive force field and silicon dry/wet oxidation process modeling”. In:npj Computational Materials9.1 (2023), p. 161
work page 2023
-
[63]
Machine learning force field for thermal oxidation of silicon
L. Cvitkovich, F. Fehringer, C. Wilhelmer, D. Milardovich, D. Waldh¨ or, and T. Grasser. “Machine learning force field for thermal oxidation of silicon”. In:The Journal of Chemical Physics161.14 (2024)
work page 2024
-
[64]
Atom-centered symmetry functions for constructing high-dimensional neural network potentials
J. Behler. “Atom-centered symmetry functions for constructing high-dimensional neural network potentials”. In:The Journal of chemical physics134.7 (2011)
work page 2011
-
[65]
Riemann manifold langevin and hamiltonian monte carlo methods
M. Girolami and B. Calderhead. “Riemann manifold langevin and hamiltonian monte carlo methods”. In:Journal of the Royal Statistical Society Series B: Statistical Methodology73.2 (2011), pp. 123–214
work page 2011
-
[66]
C. Durkan, A. Bekasov, I. Murray, and G. Papamakarios. “Neural spline flows”. In:Advances in neural information processing systems32 (2019)
work page 2019
-
[67]
R. A. Fisher. “Dispersion on a sphere”. In:Proceedings of the royal society of London. Series A. Mathematical and physical sciences217.1130 (1953), pp. 295–305
work page 1953
-
[68]
High-Dimensional Continuous Control Using Generalized Advantage Estimation
J. Schulman, P. Moritz, S. Levine, M. Jordan, and P. Abbeel. “High-dimensional continuous control using generalized advantage estimation”. In:arXiv preprint arXiv:1506.02438(2015)
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[69]
W. Futako, N. Mizuochi, and S. Yamasaki. “In situ ESR observation of interface dangling bond formation processes during ultrathin SiO 2 growth on Si (111)”. In:Physical Review Letters92.10 (2004), p. 105505
work page 2004
-
[70]
Inherent Si dangling bond defects at the thermal (110) Si/SiO 2 interface
K. Keunen, A. Stesmans, and V. Afanas’ ev. “Inherent Si dangling bond defects at the thermal (110) Si/SiO 2 interface”. In:Physical Review B—Condensed Matter and Materials Physics 84.8 (2011), p. 085329
work page 2011
-
[71]
K. P. Huber and G. Herzberg.Constants of Diatomic Molecules. Ed. by P. J. Linstrom and W. G. Mallard. NIST Chemistry WebBook, NIST Standard Reference Database Number 69, National Institute of Standards and Technology, Gaithersburg MD, 20899. Retrieved May 6, 2026.url:https://doi.org/10.18434/T4D303
-
[72]
Vibrational thermodynamics of materials
B. Fultz. “Vibrational thermodynamics of materials”. In:Progress in Materials Science55.4 (2010), pp. 247–352. 35
work page 2010
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.