Analytical Nuclear Gradients for State-Averaged Configuration Interaction Singles Variants: Application to Conical Intersections
Pith reviewed 2026-05-15 21:46 UTC · model grok-4.3
The pith
Analytical nuclear gradients for state-averaged CIS variants enable efficient minimum-energy conical intersection searches at mean-field cost.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
State-averaged orbital-optimized CIS captures the essential degeneracy at conical intersections through variational orbital relaxation, which alleviates ground-state Hartree-Fock orbital bias and effectively incorporates static correlation through localization effects. The Lagrangian formulation of the analytical nuclear gradients, with explicit removal of null-space contributions in the coupled perturbed equations, ensures numerically stable gradients that enable black-box, qualitatively reliable descriptions of these intersections at mean-field computational cost.
What carries the argument
The Lagrangian approach to analytical nuclear gradients for SACIS and SAECIS, which removes null-space contributions in the coupled perturbed equations to ensure numerical stability.
Load-bearing premise
The state-averaged orbital optimization and Lagrangian gradient formulation remain numerically stable and qualitatively correct for conical intersections even when higher double-excitation character is present, without post-hoc adjustments.
What would settle it
A benchmark calculation on one of the twelve MECXs or a new system with strong double-excitation character where SACIS or SAECIS yields an RMSD exceeding 0.1 Å or fails to locate a degenerate point.
read the original abstract
We derive analytical nuclear gradients for state-averaged orbital-optimized configuration interaction singles (SACIS) and its spin-projected extension (SAECIS), enabling efficient geometry optimization and minimum-energy conical intersection (MECX) searches within a low-cost CIS-based framework. The formulation employs a Lagrangian approach and explicitly removes null-space contributions in the coupled perturbed equations to ensure numerically stable gradients. For twisted-pyramidalized ethylene, both SACIS and SAECIS qualitatively reproduce the correct conical intersection topology, in sharp contrast to conventional CIS and ECIS. Benchmark calculations on twelve MECXs demonstrate that both methods reproduce geometries with mean RMSDs below 0.1~{\AA} relative to high-level reference methods. SACIS captures the essential degeneracy through variational orbital relaxation, which alleviates ground-state Hartree--Fock (HF) orbital bias and effectively incorporates static correlation through localization effects; notably, spin projection is found to be non-essential for the qualitative description of these intersections. Overall, SACIS and SAECIS provide qualitatively reliable CX descriptions at mean-field computational cost in a black-box manner. Given their comparable accuracy and the additional overhead associated with spin projection, SACIS offers a more favorable cost-performance balance for general applications, whereas SAECIS may become advantageous when higher excited states with significant double-excitation character are involved.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript derives analytical nuclear gradients for state-averaged orbital-optimized configuration interaction singles (SACIS) and its spin-projected extension (SAECIS) using a Lagrangian approach that explicitly removes null-space contributions from the coupled-perturbed equations for numerical stability. These gradients are applied to geometry optimizations and minimum-energy conical intersection (MECX) searches. Benchmarks on twelve MECXs show mean RMSDs below 0.1 Å relative to high-level references, with SACIS reproducing correct topologies for cases like twisted-pyramidalized ethylene through variational orbital relaxation.
Significance. If validated, this provides a low-cost, black-box method for exploring conical intersections in systems where multireference methods are too expensive. The explicit handling of orbital relaxation and degeneracy in the gradient formulation is a notable technical contribution, supported by the reported benchmark accuracies.
major comments (1)
- The Lagrangian formulation removes null-space contributions to ensure stability. At conical intersections, the two states are degenerate, so the null space includes relative orbital rotations between them. Projecting these out may affect components in the branching plane. The ethylene example reproduces correct topology, but no direct comparison of gradients or geometries with versus without the projection is provided, which is needed to confirm the approximation does not alter the effective nuclear gradients.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our work and the constructive comment on the stability of the Lagrangian formulation. We address the concern below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: The Lagrangian formulation removes null-space contributions to ensure stability. At conical intersections, the two states are degenerate, so the null space includes relative orbital rotations between them. Projecting these out may affect components in the branching plane. The ethylene example reproduces correct topology, but no direct comparison of gradients or geometries with versus without the projection is provided, which is needed to confirm the approximation does not alter the effective nuclear gradients.
Authors: We appreciate the referee's careful reading and the valid point raised about the null-space projection at degeneracies. In the Lagrangian approach, the null-space components correspond to relative orbital rotations between the degenerate states; these directions have zero eigenvalues in the coupled-perturbed equations and therefore do not contribute to the physical nuclear gradients or to the branching-plane vectors. Removing them is strictly a numerical stabilization step that leaves the effective gradient unchanged. The fact that both SACIS and SAECIS recover the correct conical-intersection topology for twisted-pyramidalized ethylene (in contrast to standard CIS) and yield mean RMSDs below 0.1 Å across the twelve-molecule benchmark set already indicates that the projection preserves the essential physics. Nevertheless, we acknowledge that an explicit side-by-side comparison was not provided in the original manuscript. In the revised version we will add a short supplementary analysis for the ethylene MECX that reports the gradient norms and optimized geometries obtained with and without the projection; the differences are expected to be negligible (<0.01 Å RMSD) and the branching-plane vectors identical within numerical precision. This addition will directly address the referee's request. revision: yes
Circularity Check
No circularity: Lagrangian gradient derivation is independent of fitted data or self-citations
full rationale
The paper presents an explicit derivation of analytical nuclear gradients for SACIS/SAECIS using a Lagrangian formulation with null-space removal in the coupled-perturbed equations. No step reduces a claimed prediction or result to a fitted parameter, self-defined quantity, or load-bearing self-citation whose validity depends on the present work. Benchmarks compare to external high-level references rather than internal fits, and the topology reproduction for ethylene is shown via direct computation rather than by construction. The derivation chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Born-Oppenheimer approximation for separating nuclear and electronic degrees of freedom
- domain assumption Validity of state-averaged orbital optimization for capturing static correlation at conical intersections
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquationwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The formulation employs a Lagrangian approach and explicitly removes null-space contributions in the coupled perturbed equations to ensure numerically stable gradients.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
TD$\Delta$SCF: Time-Dependent Density Functional Theory with a Non-Aufbau Reference for near-degenerate states
TDΔSCF performs TDDFT linear response on a non-Aufbau ΔSCF reference to describe near-degenerate singlet states with less functional sensitivity and fewer spurious states than collinear SF-TDDFT.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.