Hypothesis Tests for Observing Quantum Entanglement in HWW at the LHC
Pith reviewed 2026-05-20 01:46 UTC · model grok-4.3
The pith
A continuous CGLMP inequality and diffusion models allow standard hypothesis tests for quantum entanglement in Higgs to WW decays at the LHC.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The diffusion-based reconstruction enables robust hypothesis testing of quantum entanglement in a realistic collider environment, with 3σ evidence of quantum entanglement projected at approximately 555 fb^{-1} and exceeding 5σ at 1600 fb^{-1} to be well within the expected limits of the HL-LHC luminosity targets.
What carries the argument
Conditional denoising diffusion probabilistic models for multidimensional unfolding of neutrino momenta to obtain the continuous CGLMP variable.
If this is right
- The full measured dataset including backgrounds becomes usable for the entanglement test without separate background subtraction steps.
- Systematic uncertainties from background normalisation and unfolding shape can be propagated directly into the hypothesis test via profile likelihood.
- Projected significances reach 3 sigma at 555 fb inverse and exceed 5 sigma at 1600 fb inverse under HL-LHC luminosity targets.
- The method replaces outlier-sensitive Bell operator averages with a continuous variable suited to standard statistical tests.
Where Pith is reading between the lines
- The same unfolding technique could be tested on other LHC channels that contain invisible particles to study quantum correlations.
- If the method holds, early LHC data sets below 555 fb inverse might still yield marginal evidence once the continuous CGLMP formulation is applied.
- Extensions to other quantum information observables beyond the CGLMP inequality become feasible with similar diffusion reconstruction.
Load-bearing premise
The conditional denoising diffusion probabilistic models provide unbiased multidimensional unfolding applicable to the full measured dataset including backgrounds without introducing artifacts that would invalidate the hypothesis test.
What would settle it
A simulation study in which the diffusion unfolding shifts the distribution of the continuous CGLMP variable enough to change the profile likelihood ratio from the expected entangled case to the separable case would falsify the claim of robustness.
Figures
read the original abstract
We present a novel experimental strategy for testing quantum entanglement in Higgs boson decays to $W$ boson pairs at the Large Hadron Collider. Unlike theoretical approaches that rely on expectation values of Bell operators, which are highly sensitive to outliers and detector effects, we introduce a continuous formulation of the CGLMP inequality that enables standard hypothesis testing between entangled and separable states. To overcome the fundamental challenge of reconstructing invisible neutrino momenta in the $H \rightarrow WW^* \rightarrow \ell\nu\ell\nu$ channel, we employ conditional denoising diffusion probabilistic models (cDDPM), which provide unbiased, multidimensional unfolding applicable to the full measured dataset, including backgrounds. We evaluate the diffusion-based reconstruction against analytical methods through profile likelihood hypothesis tests implemented in RooFit, with systematic uncertainties from background normalisation and unfolding shape fully propagated. Our results demonstrate that the diffusion-based approach enables robust hypothesis testing of quantum entanglement in a realistic collider environment, with 3$\sigma$ evidence of quantum entanglement projected at approximately 555~fb$^{-1}$ and exceeding 5$\sigma$ at 1600~fb$^{-1}$ to be well within the expected limits of the HL-LHC luminosity targets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes using a continuous formulation of the CGLMP inequality for hypothesis testing of quantum entanglement in H → WW* → ℓνℓν decays at the LHC. Conditional denoising diffusion probabilistic models (cDDPM) are employed for multidimensional unfolding of neutrino momenta on the full dataset including backgrounds, with profile likelihood tests in RooFit projecting 3σ evidence at ~555 fb^{-1} and >5σ at 1600 fb^{-1} under HL-LHC luminosities after propagating background normalisation and unfolding shape systematics.
Significance. If the cDDPM unfolding is validated to be unbiased for the CGLMP variable under realistic conditions, the work would provide a practical statistical framework for quantum entanglement studies in collider environments, sidestepping sensitivities of Bell operator expectation values to outliers and detector effects. The use of standard tools like RooFit and explicit propagation of systematics would strengthen its utility for future HL-LHC analyses.
major comments (2)
- [Abstract and §3.2] Abstract and §3.2: The central claim that cDDPM delivers unbiased unfolding applicable to the full measured dataset (signal + backgrounds) without artifacts invalidating the CGLMP hypothesis test is not supported by quantitative closure tests. No results are shown demonstrating recovery of the input entanglement hypothesis within quoted uncertainties when realistic background fractions are included; this directly underpins the projected 3σ/5σ reaches.
- [§4.1] §4.1: The comparison to analytical unfolding methods is mentioned only briefly without quantitative metrics on bias in the reconstructed neutrino momenta or its propagation to the continuous CGLMP distribution, leaving the robustness of the diffusion-based approach for the profile likelihood test insufficiently demonstrated.
minor comments (2)
- The explicit construction of the continuous CGLMP variable from the reconstructed four-momenta is not detailed, which would improve reproducibility and allow assessment of potential unfolding-induced shifts.
- Figure captions and legends should more clearly label the entangled versus separable hypotheses and the impact of systematic variations on the test statistic.
Simulated Author's Rebuttal
We thank the referee for their careful reading of the manuscript and for the constructive comments, which have helped us improve the presentation of our results. We address each major comment below and have revised the manuscript accordingly to provide the requested quantitative validations.
read point-by-point responses
-
Referee: [Abstract and §3.2] Abstract and §3.2: The central claim that cDDPM delivers unbiased unfolding applicable to the full measured dataset (signal + backgrounds) without artifacts invalidating the CGLMP hypothesis test is not supported by quantitative closure tests. No results are shown demonstrating recovery of the input entanglement hypothesis within quoted uncertainties when realistic background fractions are included; this directly underpins the projected 3σ/5σ reaches.
Authors: We agree that quantitative closure tests on the full dataset including backgrounds are necessary to support the central claim. In the revised manuscript we have added a dedicated closure-test subsection to §3.2. The new tests use simulated samples with realistic background fractions (~40 % as employed in the analysis) and show that the input CGLMP distribution corresponding to the entangled hypothesis is recovered within the quoted uncertainties after cDDPM unfolding. The profile-likelihood ratio evaluated on the unfolded data yields significances consistent with the reported projections, confirming that no artifacts invalidate the hypothesis test. The abstract has been updated to reference these validation results. revision: yes
-
Referee: [§4.1] §4.1: The comparison to analytical unfolding methods is mentioned only briefly without quantitative metrics on bias in the reconstructed neutrino momenta or its propagation to the continuous CGLMP distribution, leaving the robustness of the diffusion-based approach for the profile likelihood test insufficiently demonstrated.
Authors: We acknowledge that the original §4.1 comparison was too brief and lacked quantitative metrics. The revised section now includes explicit bias and resolution figures for the reconstructed neutrino four-momenta (average bias reduction of 15 % in pT and η relative to the analytical method) together with the propagated effect on the continuous CGLMP variable (standard deviation differs by <5 %). We also show the impact of these differences on the profile-likelihood p-values under the included systematics, demonstrating that the diffusion-based approach remains robust for the entanglement hypothesis test. revision: yes
Circularity Check
No significant circularity; derivation remains self-contained
full rationale
The paper's central chain introduces a continuous CGLMP formulation for hypothesis testing, applies cDDPM for multidimensional unfolding of neutrino momenta (including backgrounds), and evaluates via profile-likelihood tests in RooFit with explicit propagation of background normalisation and unfolding-shape systematics. Significance projections at 555 fb^{-1} (3σ) and 1600 fb^{-1} (5σ) are obtained from simulated datasets rather than by re-using a fitted parameter or self-citation as the result itself. No equation or step reduces the claimed reach to an input by construction, and the method is benchmarked against analytical reconstruction, satisfying the criteria for an independent, externally falsifiable derivation.
Axiom & Free-Parameter Ledger
free parameters (2)
- background normalisation factors
- unfolding shape parameters
axioms (2)
- domain assumption The cDDPM reconstruction is unbiased across the full phase space including backgrounds.
- domain assumption Standard LHC simulation accurately models the detector response for the purpose of training the diffusion model.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We employ conditional denoising diffusion probabilistic models (cDDPM) for neutrino reconstruction... profile likelihood hypothesis tests implemented in RooFit
-
IndisputableMonolith/Foundation/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
continuous formulation of the CGLMP inequality... hypothesis testing between entangled and separable states
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Inference During inference, each event’s detector-level measurements are first passed through the analytical neutrino re- construction [36] to obtain a starting guess for the helicity angles. These analytical angles, encoded in the same (arctanh cosθ,sinϕ,cosϕ) representation as the training targets, serve as the initialisation for awarm-startreverse diff...
-
[2]
Under the production cDDPM (standard sampling, no warm start; Sec
Closure and Bias Closure is validated process-by-process by comparing the unfoldedI 3 distribution to the truth distribution on the same events (Figure 7). Under the production cDDPM (standard sampling, no warm start; Sec. D 10 e), the per-bin HWW residuals have rms 2% with all bins below 5%; fort ¯t,W W, andZ→τ τthe per-bin residuals have rms 15%, 10%, a...
-
[3]
cDDPM Systematic Studies The remainder of this appendix documents the systematic exploration of the cDDPM design space. We vary the network architecture, noise schedule, inference parameters, training data volume, and regularisation, totalling over 40 configurations. The goal is to demonstrate that the chosen v3 design (512-unit, 4-layer network with line...
work page 2048
-
[4]
Feature Definition and Selection The Bell inequality observableI 3 is constructed from the helicity angles of theWboson decay products in the Wrest frames. Any multivariate selection that uses variables correlated with these angles risks sculpting theI 3 distribution and biasing the entanglement measurement. We therefore classify all candidate BDT input f...
-
[5]
BDT Architecture: Split Classifiers with Tuned Variables The three background processes (t ¯t, WW, andZ→τ τ) have fundamentally different kinematic signatures relative to the HWW signal. Training a single BDT against the combined background forces the classifier to spend capacity on the easy (t¯t) separation at the expense of the hard (WW) separation. We ...
-
[6]
Overtraining Validation Overtraining is excluded by comparing the BDT score distributions on the training and held-out test samples. The two-sample KS statistic between signal train and test scores is below 0.02 for all three split classifiers, consistent with statistical fluctuations
-
[7]
Discrimination Performance The receiver operating characteristic (ROC) curves for each split classifier are shown in Figure 22, along with the performance of the single combined BDT for reference. The split classifiers match or exceed the combined BDT for each individual background, particularly fort ¯twhere the dedicated classifier achieves near-perfect ...
-
[8]
Working-Point Selection The operating point of the BDT selection must balance two competing objectives: maximising the statistical sensitivityS/ √ B, and preserving the shape of theI 3 distribution. The latter is essential because any shape distortion introduced by the selection would mimic or obscure the entanglement signature. We adopt a two-step proced...
work page 2000
-
[9]
Angular Feature Bias Test To justify the restriction to non-angular features, we trained a parallel BDT using the full set of 16 features (9 non-angular + 7 angular:η ℓ1,η ℓ2, ∆ϕℓℓ, ∆ηℓℓ, ∆Rℓℓ, ∆ϕ(ℓ1, Emiss T ), ∆ϕ(ℓ2, Emiss T )). Figure 25 compares theI 3 shape distortion as a function of the BDT cut for the non-angular and all-features variants. The all...
-
[10]
Extended Feature Study: The Irreducible WW Wall The WW background is irreducible in the sense that it shares the sameℓνℓνfinal state as the signal, and the physical differences between Higgs-mediated and non-resonant WW production are subtle. To investigate whether the BDT discrimination against WW can be improved, we defined an extended set of features t...
-
[11]
Hyperparameter Optimisation The XGBoost hyperparameters were optimised using a 3-fold cross-validated grid search over: number of trees ∈ {100,200,300,400,500}, maximum depth∈ {3,4,5}, learning rate∈ {0.03,0.05,0.10}, minimum child weight ∈ {5,10,15,20},γ(minimum split loss)∈ {0.1,0.3,0.5,1.0}, subsample fraction∈ {0.7,0.8}, and L2 regularisationλ∈ {1.0,3...
-
[12]
Architecture Comparison Six alternative BDT architectures were investigated before settling on the split-tuned design: 1.Single combined:one binary BDT trained on HWW vs the combined WW+t ¯t+Z→τ τbackground. 2.Split (same cut):three per-background BDTs, each using all 9 features, with the same score cut applied to all three. 35 3.Split (t ¯t+WW only):two ...
-
[13]
Event Yields After BDT Selection The stacked signal-plus-backgroundI 3 distributions before and after the BDT selection are shown in Figure 26. The BDT removes 96% of thet ¯tbackground, halves the WW contamination, and reducesZ→τ τby a factor of three, while retaining 69% of the HWW signal. Table X summarises the expected event yields at each stage of the...
work page 2000
-
[14]
Profile Likelihood Method We employ the profile likelihood ratio as the test statistic for discriminating between entangled and separable hypotheses. The presence of quantum entanglement is assessed through a discovery-style hypothesis test in which the fully sepa- rable configuration is treated as the null hypothesis. The single parameter of interest is ...
-
[15]
Likelihood Construction The likelihood is constructed from the binned distribution of the continuous CGLMP observableI (e) 3 : L(α, θ) = Y i Poisson(ni |ν i(α, θ)),(F5) wheren i is the observed count in biniandν i is the expected count obtained from Eq. (F3) plus the backgrounds, νi(α, θ) =α s ent i (θ) + (1−α)s sep i (θ) + X j bij(θ).(F6) Heres ent i and...
-
[16]
Event Selection and Likelihood Setup The hypothesis test is implemented using RooFit and RooStats [16]. Template distributions are represented using the HistFactory formalism [31] and combined in a RooWorkspace that encodes the full likelihood including nuisance parameters. The likelihood model is constructed for a single inclusive signal region, using a ...
-
[17]
In this analysis we include two classes of nuisance parameter: a
Systematic Uncertainties In a realistic analysis, systematic uncertainties would be incorporated as nuisance parameters that modify the expected yields and shapes of the templates, including experimental and theoretical uncertainties. In this analysis we include two classes of nuisance parameter: a. Background normalisations.Each background process carrie...
-
[18]
For eachα true, the separable hypothesis α= 0 is tested using the profile likelihood ratio of Eq
Sensitivity to Entanglement The expected discovery sensitivity is evaluated using Asimov datasets [44] generated from the binned likelihood model at different pointsα true ∈[0,1] along the density-matrix interpolation. For eachα true, the separable hypothesis α= 0 is tested using the profile likelihood ratio of Eq. (F4), and the resulting median significa...
-
[19]
variable-width), and the histogram range
Binning Strategy The sensitivity of the hypothesis test depends on the binning of theI 3 observable through three degrees of freedom: the number of bins, the bin layout (uniform vs. variable-width), and the histogram range. Table XIII reportsZ Asimov for each scan dimension. 41 2 1 0 1 2 −2lnL(H0)/L(H1) (simple LR) 10 2 10 1 Sample density H0 toys (separa...
-
[20]
Test Statistic Choice Two test statistics are compared. The simple likelihood ratio (SLR), t=−2 ln L(θH0;n) L(θH1;n) ,(G1) evaluates the likelihoods at fixed parameter pointsθ H0 andθ H1 without profiling nuisance parameters. By the Neyman–Pearson lemma [45], this is the uniformly most powerful test for simple-vs-simple hypotheses when the model is exactl...
-
[21]
Table XIV lists seven configurations and their expected Asimov significance
Systematic Uncertainty Model The impact of each systematic uncertainty source is assessed by cumulatively enabling components of the model. Table XIV lists seven configurations and their expected Asimov significance. The production template stat-error treatment is the bootstrap-PCA model of Sec. F 4 c; the legacy per-bin Poisson row is included as a cross...
-
[22]
Table XV reports the resulting Asimov significance
Background Normalisation Sensitivity The sensitivity is studied as a function of the normalisation uncertainty assigned to each background process individually (with all other uncertainties held at their production values), and as a function of a simultaneous common scaling of all three. Table XV reports the resulting Asimov significance. TABLE XV: Backgr...
-
[23]
Luminosity Scaling The luminosity dependence of the inclusive (SR-only) sensitivity is shown in Figure 9 of the main text together with the BDT-enhanced projection. At low luminosities the significance grows approximately as √ L; at higher luminosities it falls below the √ Lreference, reflecting the systematic floor from background normalisation and unfol...
-
[24]
Nuisance Parameter Ranking The impact of each nuisance parameter is quantified through a leave-one-out procedure: each NPθ k is fixed at its nominal value (effectively removing it from the model), the significance is recomputed, and the shift ∆Zk =Z(θ\θ k)−Z(θ) (G2) is recorded. A positive ∆Z k indicates that the NP degrades sensitivity (its removal impro...
-
[25]
Goodness-of-Fit Validation The adequacy of the fit model is assessed by fitting the cDDPM-reconstructed pseudo-data and comparing the minimised negative log-likelihood (NLL) to the distribution of NLL values obtained from 500 toy pseudo-experiments generated from the best-fit model. Thep-value is the fraction of toys with a worse (higher) NLL than the obs...
-
[26]
Expected CL s Limits As a complement to the discovery-style hypothesis test, we compute the expected 95% CL upper limit on the separable fractionf sep using the CL s method [46]. The CL s quantity is defined as CLs = ps+b pb = P(q≥q obs |H s+b) P(q≥q obs |H b) ,(G4) whereqis the test statistic andH s+b (Hb) denotes the signal-plus-background (background-o...
-
[27]
Toy and Template Convergence The Asimov significance is constant atZ≈1.13σacross all toy counts (N toys = 500–20,000), confirming that the asymptotic approximation is robust and the production value ofN toys = 5000 is more than sufficient. Signal and background templates are constructed by bootstrap resampling (Nboot replicas drawn with replacement). The ...
-
[28]
Template Statistical-Uncertainty Model The cDDPM-unfolded signal templates carry a finite-sample statistical uncertainty whose covariance structure across bins is fixed by the multinomial sampling distribution at the target yield. Two parameterisations are compared: (i) the production bootstrap-PCA construction defined in Sec. F 4 c, in which theN boot ×n...
-
[29]
The operatorsO HW,HB are CP-even while ˜OHW,HB are CP-odd
SMEFT Operators AffectingHW WCoupling At dimension-6, the relevant operators in the Warsaw basis are [7, 8]: OHW =H †H W I µνW Iµν , OHB =H †H BµνBµν,(H1) ˜OHW =H †H W I µν ˜W Iµν , ˜OHB =H †H Bµν ˜Bµν, where ˜W µν = 1 2 ϵµνρσ Wρσ is the dual field strength tensor. The operatorsO HW,HB are CP-even while ˜OHW,HB are CP-odd
-
[30]
Modified Density Matrix and Sensitivity Structure The helicity amplitudes forH→W +W ∗− receive corrections linear in the Wilson coefficients: Mλ+λ− =M SM λ+λ− + X k ck Λ2 M(k) λ+λ− +O(c 2/Λ4).(H2) The density matrix, proportional toMM †, therefore receives corrections at both linear and quadratic order in the Wilson coefficients. At linear order, the inte...
-
[31]
CP-Odd Observable Under a CP transformation theW + andW − roles are exchanged, so thatC ij ↔C ji. The antisymmetric combina- tionsC ij −C ji therefore change sign under CP and vanish identically in any CP-conserving theory. This motivates the per-event observable OCP = (C25 −C 52) + (C57 −C 75) + (C27 −C 72),(H4) which targets precisely the index set{2,5,...
-
[32]
CP-Even Observable The CP-even operatorO HW modifies the diagonal and symmetric off-diagonal elementsC 33,C 38,C 83,C 88. These elements are absent fromI 3 but can be combined into a dedicated observable: OHW =C 33 +C 38 +C 83 +C 88.(H5) 48 40 30 20 10 0 10 20 30 40 CP-odd observable OCP 0.00 0.01 0.02 0.03 0.04 0.05Density (normalised) SM: all processes ...
-
[33]
Orthogonality of Observables The observablesI 3,O CP, andO HW probe statistically independent sectors of the correlation matrix. Figure 32 shows the two-dimensional density ofI3 versusO CP for the entangled and separable hypotheses. The linear correlation coefficient isρ=−0.001, confirming that the two observables carry independent information. Similarly,...
-
[34]
Towards a Full SMEFT Analysis The distributions shown in Figures 30–32 establish the SM baseline for SMEFT-sensitive observables using existing simulation samples without BSM modifications. A complete SMEFT sensitivity study would require generating signal samples with non-zero Wilson coefficients, for example using theSMEFTatNLOmodel [47] in MadGraph, to...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.