pith. sign in

arxiv: 2601.02294 · v5 · submitted 2026-01-05 · ⚛️ physics.chem-ph

Coupling between thermochemical contributions of subvalence correlation and of higher-order post-CCSD(T) correlation effects -- a step toward `W5 theory'

Pith reviewed 2026-05-16 17:44 UTC · model grok-4.3

classification ⚛️ physics.chem-ph
keywords total atomization energypost-CCSD(T)subvalence correlationW5 theoryW4-08 benchmarkcore-valence effectsstatic correlationactive thermochemical tables
0
0 comments X

The pith

Post-CCSD(T) and subvalence correlation couple to shift total atomization energies, especially for second-row molecules.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines how higher-order electron correlation effects beyond CCSD(T) interact with subvalence (core-valence) correlation when computing total atomization energies for first- and second-row molecules. This interaction produces sizable corrections to bond energies when multiple second-row atoms sit next to one another or when static correlation is strong. Reoptimizing molecular geometries to include core-valence effects raises the calculated atomization energies across the set, with the largest gains for those second-row compounds. A first W5 theory protocol is outlined that folds these coupled contributions together, and the resulting TAE0 values for the W4-08 benchmark set show nontrivial revisions for several key species while aligning closely with active thermochemical table references.

Core claim

Post-CCSD(T) contributions, in particular connected quadruples (Q), become large whenever several neighboring second-row atoms are present; both higher-order triples T3–(T) and quadruples matter in systems with strong static correlation. Reoptimization of reference geometries for core-valence correlation increases every calculated TAE, most markedly for second-row compounds. The first W5 theory protocol built from these findings yields revised TAE values on the W4-08 set that agree well with ATcT active thermochemical tables, including the recent network extensions to boron, silicon, and sulfur compounds.

What carries the argument

The coupling between subvalence correlation and higher-order post-CCSD(T) terms in total atomization energy calculations, captured through a proposed W5 protocol.

If this is right

  • TAE0 values for several key second-row species in the W4-08 set require upward revision.
  • Geometry relaxation under core-valence correlation must be included to reach sub-kcal/mol accuracy for second-row compounds.
  • Systems with multiple adjacent second-row atoms show the largest (Q) contributions.
  • Predicted TAE0 values remain consistent with ATcT references after the coupling is accounted for.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same coupling pattern may appear in larger clusters or surfaces containing second-row elements, suggesting W5-style corrections could improve models of materials energetics.
  • If the coupling scales with the number of second-row neighbors, reaction networks involving silicon or sulfur compounds may need systematic re-evaluation.
  • Extending the protocol to reaction barriers rather than only atomization energies would test whether the same corrections apply to kinetics.

Load-bearing premise

The W4-08 benchmark set together with the chosen post-CCSD(T) approximations fully represent the coupling between subvalence and higher-order correlation for every relevant first- and second-row molecule, including those with strong static correlation.

What would settle it

A high-precision experimental or full-configuration-interaction TAE0 measurement for a second-row molecule containing two or more neighboring second-row atoms that deviates from the W5 prediction by more than the combined uncertainty.

read the original abstract

We consider the thermochemical impact of post-CCSD(T) contributions to the total atomization energy (TAE, the sum of all bond energies) of first- and second-row molecules, and specifically their coupling with the subvalence correlation contribution. In particular, we find large contributions from (Q) when there are several neighboring second-row atoms. Otherwise, both higher-order triples $T_3$--(T) and connected quadruples (Q) are important in systems with strong static correlation. Reoptimization of the reference geometry for core-valence correlation increases the calculated TAE across the board, most pronouncedly so for second-row compounds with neighboring second-row atoms. %just slightly increases the calculated TAE for all species, but more pronouncedly so if strong static correlation is present, as well as for second-row compounds, again especially with neighboring second-row atoms. We present a first proposal for a `W5 theory' protocol and compare computed TAEs for the W4-08 benchmark with prior reference values. For some key second-row species, the new values represent nontrivial revisions. Our predicted TAE$_0$ values (TAE at 0 K) agree well with the ATcT (active thermochemical tables) values, including for the very recent expansion of the ATcT network to boron, silicon, and sulfur compounds.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript examines the thermochemical coupling between subvalence (core-valence) correlation and higher-order post-CCSD(T) contributions (T3-(T) and (Q)) to total atomization energies (TAEs) for first- and second-row molecules. It reports large (Q) effects for systems with multiple neighboring second-row atoms, importance of both T3-(T) and (Q) under strong static correlation, and that reoptimizing reference geometries for core-valence correlation increases computed TAEs (most for second-row species). The authors propose an initial W5 theory protocol, compare results on the W4-08 benchmark to prior values (noting nontrivial revisions for key second-row cases), and show good agreement between predicted TAE0 values and ATcT data.

Significance. If the reported couplings and revisions hold under uniform methodology, the work provides a concrete advance toward higher-accuracy composite thermochemistry by quantifying interactions that prior W4-level protocols may have underestimated. Explicit benchmarking against external ATcT tables and identification of geometry-reoptimization effects are strengths that could inform future protocol design for molecules containing second-row elements.

major comments (2)
  1. [W4-08 benchmark and results] W4-08 benchmark set (results and discussion sections): the central claim that subvalence-(Q) coupling is large for neighboring second-row atoms and that T3-(T)/(Q) matter under static correlation rests on this set adequately sampling the relevant regime. The set appears to contain few multi second-row systems with pronounced static correlation; without additional validation cases or explicit enumeration of such species, the proposed W5 protocol risks systematic bias for untested molecules.
  2. [Methods] Methods section (geometry reoptimization and basis-set choices): the abstract and results state that reoptimization for core-valence correlation increases TAE, but it is unclear whether this was applied uniformly across all post-CCSD(T) contributions or only selectively. If the higher-order terms were evaluated at non-reoptimized geometries, the reported coupling magnitudes may be inconsistent.
minor comments (2)
  1. [Abstract] Abstract: the commented-out sentence fragment should be removed or integrated for clarity.
  2. [Abstract and results] Notation: ensure consistent use of TAE0 vs. TAE throughout; the subscript 0 is introduced late in the abstract.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. We address the major comments point by point below. We believe the suggested revisions will strengthen the manuscript.

read point-by-point responses
  1. Referee: [W4-08 benchmark and results] W4-08 benchmark set (results and discussion sections): the central claim that subvalence-(Q) coupling is large for neighboring second-row atoms and that T3-(T)/(Q) matter under static correlation rests on this set adequately sampling the relevant regime. The set appears to contain few multi second-row systems with pronounced static correlation; without additional validation cases or explicit enumeration of such species, the proposed W5 protocol risks systematic bias for untested molecules.

    Authors: We appreciate the referee's concern regarding the sampling in the W4-08 benchmark set. While W4-08 is a standard set for high-accuracy thermochemistry and does include several second-row species with multiple neighboring atoms (such as SO2, CS2, and others with potential static correlation), we acknowledge that the number of systems with pronounced multi-reference character may be limited. To strengthen the presentation, we will add an explicit enumeration of the multi second-row atom systems in W4-08 and discuss their correlation characteristics. The good agreement with ATcT values for the revised TAEs, including recent expansions to B, Si, and S compounds, provides external validation for the protocol. We will also note the need for future benchmarks on additional challenging cases. revision: partial

  2. Referee: [Methods] Methods section (geometry reoptimization and basis-set choices): the abstract and results state that reoptimization for core-valence correlation increases TAE, but it is unclear whether this was applied uniformly across all post-CCSD(T) contributions or only selectively. If the higher-order terms were evaluated at non-reoptimized geometries, the reported coupling magnitudes may be inconsistent.

    Authors: We thank the referee for pointing out this ambiguity. The reoptimization of reference geometries for core-valence correlation was indeed applied uniformly to all contributions, including the post-CCSD(T) terms T3-(T) and (Q). The higher-order correlation calculations were performed at the reoptimized geometries to ensure consistency in the reported couplings. We will revise the Methods section to explicitly clarify this procedure and provide additional details on the basis sets and optimization criteria used. revision: yes

Circularity Check

0 steps flagged

No circularity: explicit ab initio computations benchmarked to external ATcT tables

full rationale

The paper's central results derive from direct CCSD(T) and post-CCSD(T) calculations of total atomization energies (TAEs) on the W4-08 set, with geometry reoptimization and subvalence correlation contributions evaluated explicitly. These are compared to independent ATcT reference values and prior literature benchmarks. No load-bearing step reduces by definition or self-citation to a fitted parameter defined within the paper; the proposed W5 protocol is an empirical synthesis of the computed patterns rather than a tautological renaming or self-referential fit. Self-citations, if present, are not load-bearing for the core claims.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The work rests on standard quantum-chemistry assumptions plus the practical choice of which post-CCSD(T) terms to include; no new entities are postulated.

axioms (2)
  • standard math Born-Oppenheimer approximation and non-relativistic Hamiltonian are sufficient for the target accuracy
    Invoked implicitly throughout all CC calculations for first- and second-row molecules
  • domain assumption The W4-08 set is representative for assessing coupling effects
    Used as the sole benchmark for validating the W5 protocol

pith-pipeline@v0.9.0 · 5578 in / 1335 out tokens · 53340 ms · 2026-05-16T17:44:55.973200+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.