pith. sign in

arxiv: 2606.03913 · v1 · pith:J5LN3RM4new · submitted 2026-06-02 · ✦ hep-ph

Probing Singlet Vector-Like Top Quarks in the Hadronic tZ Channel at the HL-LHC using Machine and Deep Learning Architectures

Pith reviewed 2026-06-28 09:03 UTC · model grok-4.3

classification ✦ hep-ph
keywords vector-like top quarkssinglet top partnerHL-LHCmachine learninggraph neural networkhadronic tZ channelsingle productionAsimov significance
0
0 comments X

The pith

XGBoost and graph neural networks exclude vector-like singlet top partners with g* as low as 0.16 at 2σ in the hadronic tZ channel at HL-LHC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes projected sensitivities for single production of a vector-like singlet top partner T decaying to tZ in the fully hadronic final state at the 14 TeV high-luminosity LHC. Signal and background events are generated with standard Monte Carlo tools, passed through a hadronic pre-selection, and classified with jet-level features using both Extreme Gradient Boosting and a Graph Neural Network. A sympathetic reader would care because the resulting 2σ exclusion and 5σ discovery contours in the (g*, m_T) plane quantify how much parameter space can be probed or ruled out with 3000 fb^{-1} of data. The work compares results for two values of the parameter R_L and includes a 20% background systematic in the Asimov significance calculation.

Core claim

For R_L = 0 the analysis finds that 2σ exclusion is possible for g* in the interval [0.17, 0.49] with XGBoost and [0.16, 0.43] with the GNN across m_T from 1.8 to 2.7 TeV, while 5σ discovery reaches g* in [0.27, 0.44] and [0.26, 0.40] for m_T from 1.8 to 2.2 TeV; the corresponding intervals for R_L = 0.5 are shifted to slightly higher couplings but remain comparable in mass reach, with the GNN yielding marginally stronger and smoother limits in both cases.

What carries the argument

Extreme Gradient Boosting and Graph Neural Network classifiers trained on jet-level kinematic features after hadronic pre-selection (N_j ≥ 3, N_b ≥ 1, N_ℓ = 0) and optimized kinematic cuts in the tZ channel.

If this is right

  • For R_L = 0, 2σ exclusion of g* down to 0.16–0.17 is possible up to m_T = 2.7 TeV.
  • 5σ discovery is achievable for g* above 0.26–0.27 for m_T between 1.8 and 2.2 TeV.
  • The GNN produces slightly stronger limits than XGBoost across the scanned parameter space.
  • For R_L = 0.5 the 2σ exclusion window moves to g* ∈ [0.20, 0.48] up to m_T = 2.5 TeV.
  • All reaches incorporate a 20% background systematic uncertainty via the Asimov significance formula.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar jet-level ML classifiers could be applied to other vector-like quark decay modes to improve overall search sensitivity.
  • The modest advantage of the GNN over XGBoost suggests that graph representations of jet correlations may become standard for fully hadronic final states.
  • Validation of the simulated background modeling against early HL-LHC data would be required before the projected reaches can be treated as reliable.
  • Combining the hadronic tZ channel with leptonic or multi-lepton channels could further extend the mass and coupling coverage.

Load-bearing premise

Monte Carlo generators and detector simulation correctly reproduce the kinematic distributions and rates of the dominant backgrounds, and the trained classifiers will perform similarly on real collision data.

What would settle it

A statistically significant mismatch between predicted and observed yields in a signal-depleted control region or after applying the ML working point to actual HL-LHC collision data would invalidate the quoted exclusion and discovery reaches.

Figures

Figures reproduced from arXiv: 2606.03913 by Haroon Sagheer, Ijaz Ahmed, Jamil Muhammad, M. Tayyab Javaid.

Figure 1
Figure 1. Figure 1: Branching ratio Br(T → tZ) in the (RL, mT ) plane. where i = 1, 3 denotes the generation index and ci = (c1, c3). The total width is obtained by summing over all accessible modes and generations, and the branching ratio for the signal channel is Br(T → tZ) = Γ(T → Zt) Γtot . (9) In the large-mT limit the overall factor (gg∗ ) 2 cancels in ratios, so Br(T → tZ) depends pri￾marily on RL and only weakly on mT… view at source ↗
Figure 2
Figure 2. Figure 2: Representative LO t-channel diagram for single production pp → T j in the 5FS with T → tZ, t → bW → bjj, and Z → νν¯. 4 The Machine and Deep Learning Methods for the Classifica￾tion The fully hadronic tZ topology with large missing transverse momentum exhibits correlated kinematic and angular structure that is difficult to capture with cut-based selections alone. To maximize signal sensitivity while contro… view at source ↗
Figure 3
Figure 3. Figure 3: Input variable distributions for XGBoost at the benchmark mass [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Input variable distributions for GNN at the benchmark mass [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Classifier score distributions for signal and background at the benchmark mass [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: ROC curves at mT = 2000 GeV for (a) XGBoost and (b) GNN [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Test AUC vs. mT for XGBoost and GNN. 11 [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Significance vs. mT at the global threshold for (a) XGBoost and (b) GNN [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Exclusion and discovery contours in the ( [PITH_FULL_IMAGE:figures/full_fig_p013_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Exclusion and discovery contours in the ( [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
read the original abstract

In this work, we study the single production of a vector-like singlet top partner \( T \) at the 14 TeV HL-LHC in the channel \( pp \to T j \) with \( T \to t Z \), \( t \to b W \to b j j \), and \( Z \to \nu \bar{\nu} \). Signal and background samples are generated with MadGraph5\_aMC@NLO v3.5.11, showered with Pythia 8, and passed through Delphes. The dominant backgrounds are \( t \bar{t} \), \( t Z j \), \( ZZ j j \), and \( W Z j j \) (including charge conjugates). A hadronic pre-selection (\( N_j \geq 3 \), \( N_b \geq 1 \), \( N_\ell = 0 \)) is imposed as trigger, followed by optimized kinematic cuts. We perform multivariate classification with Extreme Gradient Boosting (XGBoost) and a Graph Neural Network (GNN) based on jet-level features. Sensitivities at 3000 fb\(^{-1}\) are quoted using the Asimov significance, \( S / \sqrt{S + B} \), and an Asimov variant with a 20\% background systematic. The model parameters \( g^* \) and \( R_L \) are defined in Sec.~2, and a single global working point is used to avoid per-mass tuning bias. In the \( (g^*, m_T) \) scan, we present 2\(\sigma\) exclusion and 5\(\sigma\) discovery contours for \( R_L = 0 \) and \( R_L = 0.5 \). For \( R_L = 0 \), 2\(\sigma\) exclusion corresponds to \( g^* \in [0.17, 0.49] \) (\( 0.16, 0.43 \)) over \( m_T \in [1.8, 2.7] \) TeV, while 5\(\sigma\) discovery corresponds to \( g^* \in [0.27, 0.44] \) (\( 0.26, 0.40 \)) over \( m_T \in [1.8, 2.2] \) TeV for XGBoost and GNN respectively. For \( R_L = 0.5 \), the 2\(\sigma\) reach is \( g^* \in [0.21, 0.48] \) (\( 0.20, 0.43 \)) over \( m_T \in [1.8, 2.5] \) TeV, and the 5\(\sigma\) reach is \( g^* \in [0.33, 0.43] \) (\( 0.31, 0.49 \)) over \( m_T \in [1.8, 2.2] \) TeV, with the GNN yielding slightly stronger and smoother limits across the scan.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper studies single production of a singlet vector-like top partner T in the pp → Tj, T → tZ (hadronic) channel at the 14 TeV HL-LHC with 3000 fb⁻¹. Samples are generated with MadGraph5_aMC@NLO + Pythia 8 + Delphes; after hadronic pre-selection (Nj ≥ 3, Nb ≥ 1, Nℓ = 0) and kinematic cuts, XGBoost and a jet-based GNN are used for classification against tt̄, tZj, ZZjj, and WZjj backgrounds. Sensitivities are reported via the Asimov significance S/√(S+B) (with and without 20% background systematic) as 2σ exclusion and 5σ discovery contours in the (g*, m_T) plane for RL = 0 and RL = 0.5, with explicit numerical ranges given for each classifier.

Significance. If the Monte Carlo modeling of the fully hadronic final state is reliable, the work supplies concrete, up-to-date HL-LHC projections that incorporate modern jet-level classifiers (including a GNN) and a single global working point to reduce tuning bias. The explicit tabulation of g* intervals for both exclusion and discovery is a useful benchmark for experimental planning. The central results, however, rest entirely on the unvalidated simulation chain.

major comments (2)
  1. [multivariate classification section] The manuscript supplies no information on training/validation splits, hyperparameter optimization, or cross-validation procedure for the XGBoost and GNN classifiers (multivariate classification section). Because the quoted 2σ and 5σ reaches are obtained directly from the classifier output distributions via the Asimov formula, these details are required to assess whether the reported performance is robust or overfit.
  2. [simulation and background modeling section] No control-region closure tests, generator-variation studies, or data-driven background estimates are described for the dominant backgrounds (tt̄, tZj, ZZjj, WZjj) in the presence of MET (simulation and background modeling section). All sensitivity contours in the (g*, m_T) plane are derived exclusively from these simulated samples; any mismodeling in jet energy scale, b-tagging, or MET tails would directly shift the Asimov significances.
minor comments (1)
  1. [Abstract] The abstract states that kinematic cuts are 'optimized' but neither the optimization criterion nor the numerical cut values are given; these should be listed explicitly in the main text.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the careful review and constructive comments. We address each major comment below and indicate the revisions planned for the manuscript.

read point-by-point responses
  1. Referee: [multivariate classification section] The manuscript supplies no information on training/validation splits, hyperparameter optimization, or cross-validation procedure for the XGBoost and GNN classifiers (multivariate classification section). Because the quoted 2σ and 5σ reaches are obtained directly from the classifier output distributions via the Asimov formula, these details are required to assess whether the reported performance is robust or overfit.

    Authors: We agree that these methodological details are essential to evaluate robustness and potential overfitting. In the revised manuscript we will expand the multivariate classification section to specify the training/validation/test split (70/15/15), the hyperparameter optimization procedure (grid search combined with 5-fold cross-validation), and the cross-validation strategy used for both XGBoost and the GNN. These additions will allow readers to assess the reliability of the Asimov significances derived from the classifier outputs. revision: yes

  2. Referee: [simulation and background modeling section] No control-region closure tests, generator-variation studies, or data-driven background estimates are described for the dominant backgrounds (tt̄, tZj, ZZjj, WZjj) in the presence of MET (simulation and background modeling section). All sensitivity contours in the (g*, m_T) plane are derived exclusively from these simulated samples; any mismodeling in jet energy scale, b-tagging, or MET tails would directly shift the Asimov significances.

    Authors: This is a prospective HL-LHC projection study performed entirely with Monte Carlo samples, so data-driven background estimates and control-region closure tests with real data are not possible. In revision we will augment the simulation section with additional details on generator settings, a qualitative discussion of jet-energy-scale and b-tagging uncertainties, and an explicit statement of how the already-included 20% background systematic is intended to cover potential mismodeling. Generator-variation studies will be added where computationally feasible. revision: partial

standing simulated objections not resolved
  • Data-driven background estimates and control-region closure tests cannot be performed in the absence of actual HL-LHC collision data.

Circularity Check

0 steps flagged

No significant circularity in sensitivity projections

full rationale

The paper generates signal and background samples using external MC generators (MadGraph5_aMC@NLO, Pythia 8, Delphes), applies fixed pre-selections and kinematic cuts, trains XGBoost/GNN classifiers on jet-level features, and computes Asimov significances directly from the simulated yields to obtain the (g*, m_T) contours. No parameters are fitted to the target reaches, no self-citations justify uniqueness or ansatze, and no step reduces by construction to its own inputs. The derivation is self-contained forward simulation; the load-bearing assumptions concern MC fidelity rather than internal circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the accuracy of standard Monte Carlo tools for background modeling and on the assumption that ML performance on simulation transfers to data; no new free parameters are fitted to the target result.

axioms (1)
  • domain assumption Monte Carlo event generators and Delphes detector simulation accurately model signal and background kinematics and rates in the hadronic final state.
    Invoked implicitly when the paper uses generated samples to train classifiers and compute significances.
invented entities (1)
  • Singlet vector-like top partner T no independent evidence
    purpose: Signal hypothesis whose production and decay are simulated to set experimental reaches.
    The particle is postulated as an extension of the Standard Model; no independent evidence is provided within the paper.

pith-pipeline@v0.9.1-grok · 6145 in / 1562 out tokens · 32519 ms · 2026-06-28T09:03:46.609877+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

21 extracted references · 19 canonical work pages · 3 internal anchors

  1. [1]

    Aguilar-Saavedra, R

    J.A. Aguilar-Saavedra, R. Benbrik, S. Heinemeyer, and M. Perez-Victoria. Handbook of vectorlike quarks: Mixing and single production.Phys. Rev. D, 88:094010, 2013. doi: 10.1103/PhysRevD.88.094010

  2. [2]

    Alwall, R

    J. Alwall, R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, O. Mattelaer, H.-S. Shao, T. Stelzer, P. Torrielli, and M. Zaro. The automated computation of tree-level and next-to- leading order differential cross sections, and their matching to parton shower simulations. JHEP, 07:079, 2014. doi: 10.1007/JHEP07(2014)079

  3. [3]

    Apollinari, I

    G. Apollinari, I. Bejar Alonso, O. Bruning, M. Lamont, and L. Rossi. High-luminosity large hadron collider (hl-lhc): Preliminary design report. Technical Report CERN-2015- 005, CERN, 2015

  4. [4]

    Cohen, Emanuel Katz, and Ann E

    Nima Arkani-Hamed, Andrew G. Cohen, Emanuel Katz, and Ann E. Nelson. The littlest higgs.JHEP, 07:034, 2002. doi: 10.1088/1126-6708/2002/07/034

  5. [5]

    Buckley, J

    A. Buckley, J. Ferrando, S. Lloyd, K. Nordstrom, B. Page, M. Rufenacht, M. Schonherr, and G. Watt. Lhapdf6: parton density access in the lhc precision era.Eur. Phys. J. C, 75: 132, 2015. doi: 10.1140/epjc/s10052-015-3318-8

  6. [6]

    Cacciapaglia, A

    G. Cacciapaglia, A. Deandrea, L. Panizzi, N. Gaur, D. Harada, and Y. Okada. Heavy vector-like top partners at the lhc and flavour constraints.JHEP, 03:070, 2012. doi: 10.1007/JHEP03(2012)070

  7. [7]

    Cacciapaglia, A

    G. Cacciapaglia, A. Deandrea, N. Gaur, D. Harada, Y. Okada, and L. Panizzi. The lhc potential of vector-like quark doublets.JHEP, 11:055, 2018. doi: 10.1007/JHEP11(2018) 055

  8. [8]

    Abratenko et al

    G. Cacciapaglia, A. Deandrea, L. Panizzi, et al. A global view on vector-like quarks: K- factors and phenomenology.Phys. Lett. B, 793:206–211, 2019. doi: 10.1016/j.physletb. 2019.04.040. 15

  9. [9]

    doi: 10.1145/2939672.2939785

    T. Chen and C. Guestrin. Xgboost: A scalable tree boosting system.Proc. KDD, 2016. doi: 10.1145/2939672.2939785

  10. [10]

    Search for single production of vector-like t quarks decaying into ht or zt inppcollisions at √s= 13 tev with the atlas detector.JHEP, 08:153, 2023

    ATLAS Collaboration. Search for single production of vector-like t quarks decaying into ht or zt inppcollisions at √s= 13 tev with the atlas detector.JHEP, 08:153, 2023. doi: 10.1007/JHEP08(2023)153

  11. [11]

    CMS Collaboration. Search for single production of a vector-like t quark decaying to a top quark and a z boson in the final state with jets and missing transverse momentum at√s= 13 tev.JHEP, 05:093, 2022. doi: 10.1007/JHEP05(2022)093

  12. [12]

    Search for production of a single vector-like quark decaying to th or tz in the all-hadronic final state inppcollisions at √s= 13 tev.Phys

    CMS Collaboration. Search for production of a single vector-like quark decaying to th or tz in the all-hadronic final state inppcollisions at √s= 13 tev.Phys. Rev. D, 110:072012,

  13. [13]

    doi: 10.1103/PhysRevD.110.072012

  14. [14]

    Asymptotic formulae for likelihood-based tests of new physics

    G. Cowan, K. Cranmer, E. Gross, and O. Vitells. Asymptotic formulae for likelihood-based tests of new physics.Eur. Phys. J. C, 71:1554, 2011. doi: 10.1140/epjc/s10052-011-1554-0

  15. [15]

    Czakon, P

    M. Czakon, P. Fiedler, and A. Mitov. Total top-quark pair-production cross section at hadron colliders througho( alpha4 s).Phys. Rev. Lett., 110:252004, 2013. doi: 10.1103/PhysRevLett.110.252004

  16. [16]

    de Favereau, C

    J. de Favereau, C. Delaere, P. Demin, A. Giammanco, V. Lemaitre, A. Mertens, and M. Selvaggi. Delphes 3: A modular framework for fast simulation of a generic collider experiment.JHEP, 02:057, 2014. doi: 10.1007/JHEP02(2014)057

  17. [17]

    Search for the singlet vector-like top quark int→tzchannel withz→ν¯νat the 14 tev lhc.Nucl

    Haitao Li, Jinjin Chao, and Guoqing Zhang. Search for the singlet vector-like top quark int→tzchannel withz→ν¯νat the 14 tev lhc.Nucl. Phys. B, 994:116310, 2023. doi: 10.1016/j.nuclphysb.2023.116310

  18. [18]

    Springer, 2016

    Giuliano Panico and Andrea Wulzer.The Composite Nambu-Goldstone Higgs, volume 913 ofLecture Notes in Physics. Springer, 2016. doi: 10.1007/978-3-319-22617-0

  19. [19]

    A Large Mass Hierarchy from a Small Extra Dimension

    Lisa Randall and Raman Sundrum. A large mass hierarchy from a small extra dimension. Phys. Rev. Lett., 83:3370–3373, 1999. doi: 10.1103/PhysRevLett.83.3370

  20. [20]

    Shlomi, P

    J. Shlomi, P. Battaglia, and J.-R. Vlimant. Graph neural networks in particle physics. Mach. Learn. Sci. Technol., 2:021001, 2021. doi: 10.1088/2632-2153/abbf9a

  21. [21]

    An introduction to pythia 8.2

    T. Sjostrand, S. Ask, J.R. Christiansen, R. Corke, N. Desai, P. Ilten, S. Mrenna, S. Prestel, C.O. Rasmussen, and P.Z. Skands. An introduction to pythia 8.2.Comput. Phys. Commun., 191:159–177, 2015. doi: 10.1016/j.cpc.2015.01.024. 16