pith. sign in

arxiv: 2605.19754 · v1 · pith:BS7A3GGMnew · submitted 2026-05-19 · ✦ hep-ex

Hypothesis Tests for Observing Quantum Entanglement in HWW at the LHC

Pith reviewed 2026-05-20 01:46 UTC · model grok-4.3

classification ✦ hep-ex
keywords quantum entanglementHiggs bosonLHCdiffusion modelshypothesis testingCGLMP inequalityneutrino reconstructionHL-LHC
0
0 comments X

The pith

A continuous CGLMP inequality and diffusion models allow standard hypothesis tests for quantum entanglement in Higgs to WW decays at the LHC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops an experimental strategy to test quantum entanglement in Higgs boson decays to W boson pairs using LHC data. It replaces sensitive Bell operator expectation values with a continuous formulation of the CGLMP inequality that supports ordinary hypothesis testing between entangled and separable states. To reconstruct the invisible neutrino momenta in the dilepton channel, it applies conditional denoising diffusion probabilistic models for unbiased multidimensional unfolding on the full dataset including backgrounds. Profile likelihood tests in RooFit propagate systematic uncertainties from background normalisation and unfolding shape. The approach projects 3 sigma evidence of entanglement at roughly 555 inverse femtobarns and more than 5 sigma at 1600 inverse femtobarns, values reachable with HL-LHC integrated luminosity.

Core claim

The diffusion-based reconstruction enables robust hypothesis testing of quantum entanglement in a realistic collider environment, with 3σ evidence of quantum entanglement projected at approximately 555 fb^{-1} and exceeding 5σ at 1600 fb^{-1} to be well within the expected limits of the HL-LHC luminosity targets.

What carries the argument

Conditional denoising diffusion probabilistic models for multidimensional unfolding of neutrino momenta to obtain the continuous CGLMP variable.

If this is right

  • The full measured dataset including backgrounds becomes usable for the entanglement test without separate background subtraction steps.
  • Systematic uncertainties from background normalisation and unfolding shape can be propagated directly into the hypothesis test via profile likelihood.
  • Projected significances reach 3 sigma at 555 fb inverse and exceed 5 sigma at 1600 fb inverse under HL-LHC luminosity targets.
  • The method replaces outlier-sensitive Bell operator averages with a continuous variable suited to standard statistical tests.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same unfolding technique could be tested on other LHC channels that contain invisible particles to study quantum correlations.
  • If the method holds, early LHC data sets below 555 fb inverse might still yield marginal evidence once the continuous CGLMP formulation is applied.
  • Extensions to other quantum information observables beyond the CGLMP inequality become feasible with similar diffusion reconstruction.

Load-bearing premise

The conditional denoising diffusion probabilistic models provide unbiased multidimensional unfolding applicable to the full measured dataset including backgrounds without introducing artifacts that would invalidate the hypothesis test.

What would settle it

A simulation study in which the diffusion unfolding shifts the distribution of the continuous CGLMP variable enough to change the profile likelihood ratio from the expected entangled case to the separable case would falsify the claim of robustness.

Figures

Figures reproduced from arXiv: 2605.19754 by Andre Sopczak, Andrii Vak, Carsten Burgard, Lennart Voelz, Vincent Alexander Croft.

Figure 1
Figure 1. Figure 1: FIG. 1: Visualization of coordinate systems used for [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3: Distribution of the per-event product Φ [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2: Gell-Mann correlation parameters estimated [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4: Continuous CGLMP distribution for various [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Non-resonant WW production dominates the [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6: Top: normalised [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7: Per-process closure of the cDDPM v3 reconstruction. Top: density histograms (truth shaded, unfolded [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: shows the signal-plus-background I3 compo￾sition. The entangled signal sits beneath a factor-of-20 background, underscoring the importance of the profile likelihood approach over simple background subtraction. 10.0 7.5 5.0 2.5 0.0 2.5 5.0 7.5 10.0 3 0 1000 2000 3000 4000 5000 6000 Event counts Signal + background (ATLAS SR) H0: separable + bkg cDDPM data + bkg Z ! ¿¿ (1,110) tt¹ (7,800) WW (8,265) HWW ent.… view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9: Expected Asimov significance for rejecting the [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: FIG. 10: Comparison of the theoretical prediction of Ref. [12] (left) against the calculated Gell-Mann coefficients [PITH_FULL_IMAGE:figures/full_fig_p015_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: FIG. 11: Distribution of the [PITH_FULL_IMAGE:figures/full_fig_p016_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: FIG. 12: The [PITH_FULL_IMAGE:figures/full_fig_p017_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: FIG. 13: Per-event scatter of truth vs. analytically-reconstructed [PITH_FULL_IMAGE:figures/full_fig_p017_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: FIG. 14: Discrimination between entangled and separable hypotheses. Left: at truth level, the [PITH_FULL_IMAGE:figures/full_fig_p018_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: FIG. 15: OmniFold signal-only validation on [PITH_FULL_IMAGE:figures/full_fig_p020_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: FIG. 16: OmniFold signal-only closure on four truth-level observables: electron [PITH_FULL_IMAGE:figures/full_fig_p021_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: FIG. 17: OmniFold failure in the mixed-background environment. Left: truth-level electron [PITH_FULL_IMAGE:figures/full_fig_p022_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: FIG. 18: Architecture ablation: HWW [PITH_FULL_IMAGE:figures/full_fig_p025_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: FIG. 19: HWW [PITH_FULL_IMAGE:figures/full_fig_p026_19.png] view at source ↗
Figure 20
Figure 20. Figure 20: FIG. 20 [PITH_FULL_IMAGE:figures/full_fig_p027_20.png] view at source ↗
Figure 21
Figure 21. Figure 21: FIG. 21 [PITH_FULL_IMAGE:figures/full_fig_p028_21.png] view at source ↗
Figure 22
Figure 22. Figure 22: FIG. 22: ROC curves for each split-tuned BDT (solid, coloured) compared to the single combined BDT (dashed, [PITH_FULL_IMAGE:figures/full_fig_p030_22.png] view at source ↗
Figure 23
Figure 23. Figure 23: FIG. 23: Working-point scan comparing three BDT architectures. Top row: signal efficiency, S/B ratio, and [PITH_FULL_IMAGE:figures/full_fig_p031_23.png] view at source ↗
Figure 24
Figure 24. Figure 24: FIG. 24 [PITH_FULL_IMAGE:figures/full_fig_p032_24.png] view at source ↗
Figure 25
Figure 25. Figure 25: FIG. 25 [PITH_FULL_IMAGE:figures/full_fig_p033_25.png] view at source ↗
Figure 26
Figure 26. Figure 26: FIG. 26: Stacked [PITH_FULL_IMAGE:figures/full_fig_p036_26.png] view at source ↗
Figure 27
Figure 27. Figure 27: FIG. 27: Expected Asimov significance versus the density-matrix interpolation parameter [PITH_FULL_IMAGE:figures/full_fig_p040_27.png] view at source ↗
Figure 28
Figure 28. Figure 28: FIG. 28: Toy-calibrated simple-likelihood-ratio (SLR) test-statistic distributions under [PITH_FULL_IMAGE:figures/full_fig_p041_28.png] view at source ↗
Figure 29
Figure 29. Figure 29: FIG. 29: Test statistic distributions under [PITH_FULL_IMAGE:figures/full_fig_p042_29.png] view at source ↗
Figure 30
Figure 30. Figure 30: FIG. 30: Per-event CP-odd observable [PITH_FULL_IMAGE:figures/full_fig_p048_30.png] view at source ↗
Figure 31
Figure 31. Figure 31: FIG. 31: Per-event CP-even observable [PITH_FULL_IMAGE:figures/full_fig_p048_31.png] view at source ↗
Figure 32
Figure 32. Figure 32: FIG. 32: Two-dimensional density of [PITH_FULL_IMAGE:figures/full_fig_p049_32.png] view at source ↗
read the original abstract

We present a novel experimental strategy for testing quantum entanglement in Higgs boson decays to $W$ boson pairs at the Large Hadron Collider. Unlike theoretical approaches that rely on expectation values of Bell operators, which are highly sensitive to outliers and detector effects, we introduce a continuous formulation of the CGLMP inequality that enables standard hypothesis testing between entangled and separable states. To overcome the fundamental challenge of reconstructing invisible neutrino momenta in the $H \rightarrow WW^* \rightarrow \ell\nu\ell\nu$ channel, we employ conditional denoising diffusion probabilistic models (cDDPM), which provide unbiased, multidimensional unfolding applicable to the full measured dataset, including backgrounds. We evaluate the diffusion-based reconstruction against analytical methods through profile likelihood hypothesis tests implemented in RooFit, with systematic uncertainties from background normalisation and unfolding shape fully propagated. Our results demonstrate that the diffusion-based approach enables robust hypothesis testing of quantum entanglement in a realistic collider environment, with 3$\sigma$ evidence of quantum entanglement projected at approximately 555~fb$^{-1}$ and exceeding 5$\sigma$ at 1600~fb$^{-1}$ to be well within the expected limits of the HL-LHC luminosity targets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes using a continuous formulation of the CGLMP inequality for hypothesis testing of quantum entanglement in H → WW* → ℓνℓν decays at the LHC. Conditional denoising diffusion probabilistic models (cDDPM) are employed for multidimensional unfolding of neutrino momenta on the full dataset including backgrounds, with profile likelihood tests in RooFit projecting 3σ evidence at ~555 fb^{-1} and >5σ at 1600 fb^{-1} under HL-LHC luminosities after propagating background normalisation and unfolding shape systematics.

Significance. If the cDDPM unfolding is validated to be unbiased for the CGLMP variable under realistic conditions, the work would provide a practical statistical framework for quantum entanglement studies in collider environments, sidestepping sensitivities of Bell operator expectation values to outliers and detector effects. The use of standard tools like RooFit and explicit propagation of systematics would strengthen its utility for future HL-LHC analyses.

major comments (2)
  1. [Abstract and §3.2] Abstract and §3.2: The central claim that cDDPM delivers unbiased unfolding applicable to the full measured dataset (signal + backgrounds) without artifacts invalidating the CGLMP hypothesis test is not supported by quantitative closure tests. No results are shown demonstrating recovery of the input entanglement hypothesis within quoted uncertainties when realistic background fractions are included; this directly underpins the projected 3σ/5σ reaches.
  2. [§4.1] §4.1: The comparison to analytical unfolding methods is mentioned only briefly without quantitative metrics on bias in the reconstructed neutrino momenta or its propagation to the continuous CGLMP distribution, leaving the robustness of the diffusion-based approach for the profile likelihood test insufficiently demonstrated.
minor comments (2)
  1. The explicit construction of the continuous CGLMP variable from the reconstructed four-momenta is not detailed, which would improve reproducibility and allow assessment of potential unfolding-induced shifts.
  2. Figure captions and legends should more clearly label the entangled versus separable hypotheses and the impact of systematic variations on the test statistic.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the constructive comments, which have helped us improve the presentation of our results. We address each major comment below and have revised the manuscript accordingly to provide the requested quantitative validations.

read point-by-point responses
  1. Referee: [Abstract and §3.2] Abstract and §3.2: The central claim that cDDPM delivers unbiased unfolding applicable to the full measured dataset (signal + backgrounds) without artifacts invalidating the CGLMP hypothesis test is not supported by quantitative closure tests. No results are shown demonstrating recovery of the input entanglement hypothesis within quoted uncertainties when realistic background fractions are included; this directly underpins the projected 3σ/5σ reaches.

    Authors: We agree that quantitative closure tests on the full dataset including backgrounds are necessary to support the central claim. In the revised manuscript we have added a dedicated closure-test subsection to §3.2. The new tests use simulated samples with realistic background fractions (~40 % as employed in the analysis) and show that the input CGLMP distribution corresponding to the entangled hypothesis is recovered within the quoted uncertainties after cDDPM unfolding. The profile-likelihood ratio evaluated on the unfolded data yields significances consistent with the reported projections, confirming that no artifacts invalidate the hypothesis test. The abstract has been updated to reference these validation results. revision: yes

  2. Referee: [§4.1] §4.1: The comparison to analytical unfolding methods is mentioned only briefly without quantitative metrics on bias in the reconstructed neutrino momenta or its propagation to the continuous CGLMP distribution, leaving the robustness of the diffusion-based approach for the profile likelihood test insufficiently demonstrated.

    Authors: We acknowledge that the original §4.1 comparison was too brief and lacked quantitative metrics. The revised section now includes explicit bias and resolution figures for the reconstructed neutrino four-momenta (average bias reduction of 15 % in pT and η relative to the analytical method) together with the propagated effect on the continuous CGLMP variable (standard deviation differs by <5 %). We also show the impact of these differences on the profile-likelihood p-values under the included systematics, demonstrating that the diffusion-based approach remains robust for the entanglement hypothesis test. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation remains self-contained

full rationale

The paper's central chain introduces a continuous CGLMP formulation for hypothesis testing, applies cDDPM for multidimensional unfolding of neutrino momenta (including backgrounds), and evaluates via profile-likelihood tests in RooFit with explicit propagation of background normalisation and unfolding-shape systematics. Significance projections at 555 fb^{-1} (3σ) and 1600 fb^{-1} (5σ) are obtained from simulated datasets rather than by re-using a fitted parameter or self-citation as the result itself. No equation or step reduces the claimed reach to an input by construction, and the method is benchmarked against analytical reconstruction, satisfying the criteria for an independent, externally falsifiable derivation.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim depends on the unbiased performance of the cDDPM model and the statistical validity of the continuous CGLMP variable under realistic detector effects; these are not derived from first principles within the abstract.

free parameters (2)
  • background normalisation factors
    Systematic uncertainties from background normalisation are propagated in the profile likelihood fits.
  • unfolding shape parameters
    Systematic uncertainties from unfolding shape are fully propagated.
axioms (2)
  • domain assumption The cDDPM reconstruction is unbiased across the full phase space including backgrounds.
    Stated as the key enabler for applying the hypothesis test to real data.
  • domain assumption Standard LHC simulation accurately models the detector response for the purpose of training the diffusion model.
    Implicit in any unfolding procedure that relies on Monte Carlo.

pith-pipeline@v0.9.0 · 5744 in / 1511 out tokens · 29709 ms · 2026-05-20T01:46:34.646018+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

34 extracted references · 34 canonical work pages

  1. [1]

    Inference During inference, each event’s detector-level measurements are first passed through the analytical neutrino re- construction [36] to obtain a starting guess for the helicity angles. These analytical angles, encoded in the same (arctanh cosθ,sinϕ,cosϕ) representation as the training targets, serve as the initialisation for awarm-startreverse diff...

  2. [2]

    Under the production cDDPM (standard sampling, no warm start; Sec

    Closure and Bias Closure is validated process-by-process by comparing the unfoldedI 3 distribution to the truth distribution on the same events (Figure 7). Under the production cDDPM (standard sampling, no warm start; Sec. D 10 e), the per-bin HWW residuals have rms 2% with all bins below 5%; fort ¯t,W W, andZ→τ τthe per-bin residuals have rms 15%, 10%, a...

  3. [3]

    bias amplification

    cDDPM Systematic Studies The remainder of this appendix documents the systematic exploration of the cDDPM design space. We vary the network architecture, noise schedule, inference parameters, training data volume, and regularisation, totalling over 40 configurations. The goal is to demonstrate that the chosen v3 design (512-unit, 4-layer network with line...

  4. [4]

    Any multivariate selection that uses variables correlated with these angles risks sculpting theI 3 distribution and biasing the entanglement measurement

    Feature Definition and Selection The Bell inequality observableI 3 is constructed from the helicity angles of theWboson decay products in the Wrest frames. Any multivariate selection that uses variables correlated with these angles risks sculpting theI 3 distribution and biasing the entanglement measurement. We therefore classify all candidate BDT input f...

  5. [5]

    Training a single BDT against the combined background forces the classifier to spend capacity on the easy (t¯t) separation at the expense of the hard (WW) separation

    BDT Architecture: Split Classifiers with Tuned Variables The three background processes (t ¯t, WW, andZ→τ τ) have fundamentally different kinematic signatures relative to the HWW signal. Training a single BDT against the combined background forces the classifier to spend capacity on the easy (t¯t) separation at the expense of the hard (WW) separation. We ...

  6. [6]

    The two-sample KS statistic between signal train and test scores is below 0.02 for all three split classifiers, consistent with statistical fluctuations

    Overtraining Validation Overtraining is excluded by comparing the BDT score distributions on the training and held-out test samples. The two-sample KS statistic between signal train and test scores is below 0.02 for all three split classifiers, consistent with statistical fluctuations

  7. [7]

    The split classifiers match or exceed the combined BDT for each individual background, particularly fort ¯twhere the dedicated classifier achieves near-perfect separation

    Discrimination Performance The receiver operating characteristic (ROC) curves for each split classifier are shown in Figure 22, along with the performance of the single combined BDT for reference. The split classifiers match or exceed the combined BDT for each individual background, particularly fort ¯twhere the dedicated classifier achieves near-perfect ...

  8. [8]

    The latter is essential because any shape distortion introduced by the selection would mimic or obscure the entanglement signature

    Working-Point Selection The operating point of the BDT selection must balance two competing objectives: maximising the statistical sensitivityS/ √ B, and preserving the shape of theI 3 distribution. The latter is essential because any shape distortion introduced by the selection would mimic or obscure the entanglement signature. We adopt a two-step proced...

  9. [9]

    Figure 25 compares theI 3 shape distortion as a function of the BDT cut for the non-angular and all-features variants

    Angular Feature Bias Test To justify the restriction to non-angular features, we trained a parallel BDT using the full set of 16 features (9 non-angular + 7 angular:η ℓ1,η ℓ2, ∆ϕℓℓ, ∆ηℓℓ, ∆Rℓℓ, ∆ϕ(ℓ1, Emiss T ), ∆ϕ(ℓ2, Emiss T )). Figure 25 compares theI 3 shape distortion as a function of the BDT cut for the non-angular and all-features variants. The all...

  10. [10]

    Extended Feature Study: The Irreducible WW Wall The WW background is irreducible in the sense that it shares the sameℓνℓνfinal state as the signal, and the physical differences between Higgs-mediated and non-resonant WW production are subtle. To investigate whether the BDT discrimination against WW can be improved, we defined an extended set of features t...

  11. [11]

    The cross-validation was performed on the HWW vs WW+Z→τ τtask (the limiting discrimination), using the mean AUC as the ranking metric

    Hyperparameter Optimisation The XGBoost hyperparameters were optimised using a 3-fold cross-validated grid search over: number of trees ∈ {100,200,300,400,500}, maximum depth∈ {3,4,5}, learning rate∈ {0.03,0.05,0.10}, minimum child weight ∈ {5,10,15,20},γ(minimum split loss)∈ {0.1,0.3,0.5,1.0}, subsample fraction∈ {0.7,0.8}, and L2 regularisationλ∈ {1.0,3...

  12. [12]

    2.Split (same cut):three per-background BDTs, each using all 9 features, with the same score cut applied to all three

    Architecture Comparison Six alternative BDT architectures were investigated before settling on the split-tuned design: 1.Single combined:one binary BDT trained on HWW vs the combined WW+t ¯t+Z→τ τbackground. 2.Split (same cut):three per-background BDTs, each using all 9 features, with the same score cut applied to all three. 35 3.Split (t ¯t+WW only):two ...

  13. [13]

    The BDT removes 96% of thet ¯tbackground, halves the WW contamination, and reducesZ→τ τby a factor of three, while retaining 69% of the HWW signal

    Event Yields After BDT Selection The stacked signal-plus-backgroundI 3 distributions before and after the BDT selection are shown in Figure 26. The BDT removes 96% of thet ¯tbackground, halves the WW contamination, and reducesZ→τ τby a factor of three, while retaining 69% of the HWW signal. Table X summarises the expected event yields at each stage of the...

  14. [14]

    The presence of quantum entanglement is assessed through a discovery-style hypothesis test in which the fully sepa- rable configuration is treated as the null hypothesis

    Profile Likelihood Method We employ the profile likelihood ratio as the test statistic for discriminating between entangled and separable hypotheses. The presence of quantum entanglement is assessed through a discovery-style hypothesis test in which the fully sepa- rable configuration is treated as the null hypothesis. The single parameter of interest is ...

  15. [15]

    Likelihood Construction The likelihood is constructed from the binned distribution of the continuous CGLMP observableI (e) 3 : L(α, θ) = Y i Poisson(ni |ν i(α, θ)),(F5) wheren i is the observed count in biniandν i is the expected count obtained from Eq. (F3) plus the backgrounds, νi(α, θ) =α s ent i (θ) + (1−α)s sep i (θ) + X j bij(θ).(F6) Heres ent i and...

  16. [16]

    Template distributions are represented using the HistFactory formalism [31] and combined in a RooWorkspace that encodes the full likelihood including nuisance parameters

    Event Selection and Likelihood Setup The hypothesis test is implemented using RooFit and RooStats [16]. Template distributions are represented using the HistFactory formalism [31] and combined in a RooWorkspace that encodes the full likelihood including nuisance parameters. The likelihood model is constructed for a single inclusive signal region, using a ...

  17. [17]

    In this analysis we include two classes of nuisance parameter: a

    Systematic Uncertainties In a realistic analysis, systematic uncertainties would be incorporated as nuisance parameters that modify the expected yields and shapes of the templates, including experimental and theoretical uncertainties. In this analysis we include two classes of nuisance parameter: a. Background normalisations.Each background process carrie...

  18. [18]

    For eachα true, the separable hypothesis α= 0 is tested using the profile likelihood ratio of Eq

    Sensitivity to Entanglement The expected discovery sensitivity is evaluated using Asimov datasets [44] generated from the binned likelihood model at different pointsα true ∈[0,1] along the density-matrix interpolation. For eachα true, the separable hypothesis α= 0 is tested using the profile likelihood ratio of Eq. (F4), and the resulting median significa...

  19. [19]

    variable-width), and the histogram range

    Binning Strategy The sensitivity of the hypothesis test depends on the binning of theI 3 observable through three degrees of freedom: the number of bins, the bin layout (uniform vs. variable-width), and the histogram range. Table XIII reportsZ Asimov for each scan dimension. 41 2 1 0 1 2 −2lnL(H0)/L(H1) (simple LR) 10 2 10 1 Sample density H0 toys (separa...

  20. [20]

    The simple likelihood ratio (SLR), t=−2 ln L(θH0;n) L(θH1;n) ,(G1) evaluates the likelihoods at fixed parameter pointsθ H0 andθ H1 without profiling nuisance parameters

    Test Statistic Choice Two test statistics are compared. The simple likelihood ratio (SLR), t=−2 ln L(θH0;n) L(θH1;n) ,(G1) evaluates the likelihoods at fixed parameter pointsθ H0 andθ H1 without profiling nuisance parameters. By the Neyman–Pearson lemma [45], this is the uniformly most powerful test for simple-vs-simple hypotheses when the model is exactl...

  21. [21]

    Table XIV lists seven configurations and their expected Asimov significance

    Systematic Uncertainty Model The impact of each systematic uncertainty source is assessed by cumulatively enabling components of the model. Table XIV lists seven configurations and their expected Asimov significance. The production template stat-error treatment is the bootstrap-PCA model of Sec. F 4 c; the legacy per-bin Poisson row is included as a cross...

  22. [22]

    Table XV reports the resulting Asimov significance

    Background Normalisation Sensitivity The sensitivity is studied as a function of the normalisation uncertainty assigned to each background process individually (with all other uncertainties held at their production values), and as a function of a simultaneous common scaling of all three. Table XV reports the resulting Asimov significance. TABLE XV: Backgr...

  23. [23]

    Luminosity Scaling The luminosity dependence of the inclusive (SR-only) sensitivity is shown in Figure 9 of the main text together with the BDT-enhanced projection. At low luminosities the significance grows approximately as √ L; at higher luminosities it falls below the √ Lreference, reflecting the systematic floor from background normalisation and unfol...

  24. [24]

    A positive ∆Z k indicates that the NP degrades sensitivity (its removal improvesZ); a negative ∆Z k indicates the NP absorbs a bias and its removal worsensZ

    Nuisance Parameter Ranking The impact of each nuisance parameter is quantified through a leave-one-out procedure: each NPθ k is fixed at its nominal value (effectively removing it from the model), the significance is recomputed, and the shift ∆Zk =Z(θ\θ k)−Z(θ) (G2) is recorded. A positive ∆Z k indicates that the NP degrades sensitivity (its removal impro...

  25. [25]

    Goodness-of-Fit Validation The adequacy of the fit model is assessed by fitting the cDDPM-reconstructed pseudo-data and comparing the minimised negative log-likelihood (NLL) to the distribution of NLL values obtained from 500 toy pseudo-experiments generated from the best-fit model. Thep-value is the fraction of toys with a worse (higher) NLL than the obs...

  26. [26]

    Expected CL s Limits As a complement to the discovery-style hypothesis test, we compute the expected 95% CL upper limit on the separable fractionf sep using the CL s method [46]. The CL s quantity is defined as CLs = ps+b pb = P(q≥q obs |H s+b) P(q≥q obs |H b) ,(G4) whereqis the test statistic andH s+b (Hb) denotes the signal-plus-background (background-o...

  27. [27]

    Signal and background templates are constructed by bootstrap resampling (Nboot replicas drawn with replacement)

    Toy and Template Convergence The Asimov significance is constant atZ≈1.13σacross all toy counts (N toys = 500–20,000), confirming that the asymptotic approximation is robust and the production value ofN toys = 5000 is more than sufficient. Signal and background templates are constructed by bootstrap resampling (Nboot replicas drawn with replacement). The ...

  28. [28]

    Modes” denotes the number of PCA eigenmodes retained per signal template (n bins = 9 at full rank). “Variance kept

    Template Statistical-Uncertainty Model The cDDPM-unfolded signal templates carry a finite-sample statistical uncertainty whose covariance structure across bins is fixed by the multinomial sampling distribution at the target yield. Two parameterisations are compared: (i) the production bootstrap-PCA construction defined in Sec. F 4 c, in which theN boot ×n...

  29. [29]

    The operatorsO HW,HB are CP-even while ˜OHW,HB are CP-odd

    SMEFT Operators AffectingHW WCoupling At dimension-6, the relevant operators in the Warsaw basis are [7, 8]: OHW =H †H W I µνW Iµν , OHB =H †H BµνBµν,(H1) ˜OHW =H †H W I µν ˜W Iµν , ˜OHB =H †H Bµν ˜Bµν, where ˜W µν = 1 2 ϵµνρσ Wρσ is the dual field strength tensor. The operatorsO HW,HB are CP-even while ˜OHW,HB are CP-odd

  30. [30]

    Modified Density Matrix and Sensitivity Structure The helicity amplitudes forH→W +W ∗− receive corrections linear in the Wilson coefficients: Mλ+λ− =M SM λ+λ− + X k ck Λ2 M(k) λ+λ− +O(c 2/Λ4).(H2) The density matrix, proportional toMM †, therefore receives corrections at both linear and quadratic order in the Wilson coefficients. At linear order, the inte...

  31. [31]

    The antisymmetric combina- tionsC ij −C ji therefore change sign under CP and vanish identically in any CP-conserving theory

    CP-Odd Observable Under a CP transformation theW + andW − roles are exchanged, so thatC ij ↔C ji. The antisymmetric combina- tionsC ij −C ji therefore change sign under CP and vanish identically in any CP-conserving theory. This motivates the per-event observable OCP = (C25 −C 52) + (C57 −C 75) + (C27 −C 72),(H4) which targets precisely the index set{2,5,...

  32. [32]

    CP-Even Observable The CP-even operatorO HW modifies the diagonal and symmetric off-diagonal elementsC 33,C 38,C 83,C 88. These elements are absent fromI 3 but can be combined into a dedicated observable: OHW =C 33 +C 38 +C 83 +C 88.(H5) 48 40 30 20 10 0 10 20 30 40 CP-odd observable OCP 0.00 0.01 0.02 0.03 0.04 0.05Density (normalised) SM: all processes ...

  33. [33]

    Figure 32 shows the two-dimensional density ofI3 versusO CP for the entangled and separable hypotheses

    Orthogonality of Observables The observablesI 3,O CP, andO HW probe statistically independent sectors of the correlation matrix. Figure 32 shows the two-dimensional density ofI3 versusO CP for the entangled and separable hypotheses. The linear correlation coefficient isρ=−0.001, confirming that the two observables carry independent information. Similarly,...

  34. [34]

    Towards a Full SMEFT Analysis The distributions shown in Figures 30–32 establish the SM baseline for SMEFT-sensitive observables using existing simulation samples without BSM modifications. A complete SMEFT sensitivity study would require generating signal samples with non-zero Wilson coefficients, for example using theSMEFTatNLOmodel [47] in MadGraph, to...