Hits to Higgs: Hit-Level Higgs Classification from Raw LHC Detector Data Using Higgsformer
Pith reviewed 2026-05-18 21:01 UTC · model grok-4.3
The pith
A transformer model distinguishes Higgs events directly from raw inner tracker hits, matching traditional reconstruction performance.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Retraining a transformer model for hit-to-track assignment to instead classify events allows distinguishing ttH from ttbar directly from raw hits, reaching an AUC of 0.855 that matches the performance of a Particle Transformer on reconstructed objects at about 40 percent b-tagging efficiency.
What carries the argument
Higgsformer, a transformer model adapted from inner-tracker hit assignment to direct classification on raw detector hits.
If this is right
- Event classification becomes possible without the intermediate steps of track finding, vertexing, or jet clustering.
- The hit-level approach maintains performance across varying dataset sizes and pileup conditions.
- Discrimination power at the raw-hit stage can equal that of object-based methods run at typical b-tagging efficiencies.
Where Pith is reading between the lines
- If the approach transfers to real data it could reduce the computing resources needed for high-level physics analyses.
- The same hit-level strategy might be tested on other rare processes or detector subsystems beyond the inner tracker.
- Combining raw hits with limited additional information could be explored as a way to improve discrimination without full reconstruction.
Load-bearing premise
The fast simulation of inner-tracker hits produces patterns representative enough of real data for the trained model to perform similarly on actual collisions.
What would settle it
Measuring the AUC on real LHC collision data and finding it significantly below 0.855 would show that the simulation does not capture the necessary features.
read the original abstract
We present Higgsformer, a transformer-based architecture that classifies Higgs events at the Large Hadron Collider directly from raw inner tracker hits, bypassing the traditional reconstruction chain of intermediate physics objects. As a benchmark, we focus on distinguishing $t\bar{t}H$ from $t\bar{t}$ events with $H \to b\bar{b}$, a particularly challenging task due to their similar final state topologies. Our pipeline begins with event generation in Pythia8, fast simulation with ACTS/Fatras, and classification directly from raw detector hits. We show for the first time that a transformer model originally developed for inner tracker hit-to-track assignment can be retrained to classify Higgs signal events directly from raw hits. For comparison, we reconstruct the same events with Delphes and train a Particle Transformer as an object-based classifier. We evaluate both approaches under varying dataset sizes and pileup levels. Despite relying exclusively on inner tracker hits, our large Higgsformer achieves an AUC of $0.855$, matching the performance of the traditional reconstruction pipeline at a $b$-tagging efficiency of $\approx 40\%$ under the same detector constraints.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Higgsformer, a transformer architecture adapted from hit-to-track assignment tasks, to classify ttH (with H→bb) versus tt events directly from raw inner-tracker hits. Events are generated with Pythia8 and simulated with ACTS/Fatras; the model is compared to a Particle Transformer trained on Delphes-reconstructed objects. The central quantitative result is an AUC of 0.855 for the large Higgsformer, stated to match the object-based pipeline at ~40% b-tagging efficiency under identical detector constraints, with evaluations performed at varying dataset sizes and pileup levels.
Significance. If the performance generalizes beyond the fast-simulation setup, the result would demonstrate that end-to-end hit-level classification can reach parity with traditional reconstruction pipelines for a challenging final state. This would be a notable technical contribution in the direction of bypassing intermediate physics objects. The adaptation of an existing hit-to-track transformer and the direct comparison to a Delphes-based baseline are strengths; however, the absence of statistical uncertainties and simulation-fidelity checks limits the immediate impact.
major comments (2)
- [Results] Results section: the reported AUC of 0.855 is given as a single point estimate with no error bars, no dataset cardinality, and no explicit statement of the pileup multiplicity or b-tagging working point used for the final comparison. These omissions make it impossible to judge whether the quoted parity with the Delphes Particle Transformer is statistically meaningful or sensitive to the precise evaluation conditions.
- [Simulation and Methods] Simulation and detector modeling: the entire training and evaluation chain uses only ACTS/Fatras fast parametric simulation of inner-tracker hits. No cross-check against full Geant4 digitization, no quantification of differences in hit multiplicity, resolution, or multiple-scattering tails, and no discussion of how such discrepancies would propagate into the transformer’s attention patterns are provided. This is load-bearing for the claim that the model operates “directly from raw inner tracker hits” in a manner relevant to real LHC data.
minor comments (1)
- [Abstract] Abstract: the statement that performance was evaluated “under varying dataset sizes and pileup levels” is not accompanied by the actual ranges or the specific configuration that yields the quoted AUC of 0.855.
Simulated Author's Rebuttal
We thank the referee for the careful reading of our manuscript and the constructive comments, which help clarify the presentation of our results. We address each major point below and outline the corresponding revisions.
read point-by-point responses
-
Referee: [Results] Results section: the reported AUC of 0.855 is given as a single point estimate with no error bars, no dataset cardinality, and no explicit statement of the pileup multiplicity or b-tagging working point used for the final comparison. These omissions make it impossible to judge whether the quoted parity with the Delphes Particle Transformer is statistically meaningful or sensitive to the precise evaluation conditions.
Authors: We agree that the presentation of the central AUC result would benefit from additional statistical detail and explicit evaluation parameters. In the revised manuscript we will report bootstrapped uncertainties on all AUC values, state the exact training and test dataset cardinalities used for the quoted 0.855 result, and specify the pileup multiplicity together with the precise b-tagging efficiency working point at which the Delphes Particle Transformer comparison is performed. These additions will make the statistical significance of the observed parity transparent. revision: yes
-
Referee: [Simulation and Methods] Simulation and detector modeling: the entire training and evaluation chain uses only ACTS/Fatras fast parametric simulation of inner-tracker hits. No cross-check against full Geant4 digitization, no quantification of differences in hit multiplicity, resolution, or multiple-scattering tails, and no discussion of how such discrepancies would propagate into the transformer’s attention patterns are provided. This is load-bearing for the claim that the model operates “directly from raw inner tracker hits” in a manner relevant to real LHC data.
Authors: We acknowledge that reliance on fast parametric simulation constitutes a genuine limitation for immediate claims about real LHC data. The manuscript is framed as a proof-of-concept study within a controlled simulation environment, which is standard for algorithmic development. In the revision we will add a dedicated limitations subsection that (i) quantifies typical differences in hit multiplicity and resolution between Fatras and full Geant4 for the inner tracker, (ii) discusses how these differences could affect attention patterns, and (iii) clarifies that the reported performance is specific to the fast-simulation setup. We will also outline the steps required for a future full-simulation validation. This keeps the scope of the current work accurate while addressing the referee’s concern. revision: partial
Circularity Check
No circularity: empirical AUC from independent training/evaluation on simulated hits
full rationale
The paper reports an empirical performance metric (AUC 0.855) obtained by training and testing Higgsformer on Pythia8+ACTS/Fatras events and comparing it to a separately reconstructed Delphes+Particle Transformer baseline. No equations, fitted parameters, or self-citations reduce this result to an input by construction. The central claim is a direct measurement under stated simulation assumptions rather than a derivation that collapses to its own premises. Minor hyperparameter choices on the same samples do not meet the load-bearing self-definition or fitted-prediction criteria.
Axiom & Free-Parameter Ledger
free parameters (1)
- Transformer hyperparameters and training schedule
axioms (1)
- domain assumption ACTS/Fatras fast simulation produces hit-level data whose statistical properties are close enough to real detector response for the classification task.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We show for the first time that a transformer model originally developed for inner tracker hit-to-track assignment can be retrained to classify Higgs events directly from raw hits... Higgsformer achieves an AUC of 0.792.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Higgsformer-small (only inner tracker) Raw detector hits
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
T. Sj¨ ostrand, S. Ask, J.R. Christiansen, R. Corke, N. Desai, P. Ilten et al., An introduction to PYTHIA 8.2, Comput. Phys. Commun. 191 (2015) 159 [ 1410.3012]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[4]
Spannagel et al., The A Common Tracking Software project, SoftwareX 11 (2020) 100464 [1909.06194]
S. Spannagel et al., The A Common Tracking Software project, SoftwareX 11 (2020) 100464 [1909.06194]
-
[5]
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
T. Dao, FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning, arXiv preprint arXiv:2307.08691 (2023)
work page internal anchor Pith review Pith/arXiv arXiv 2023
- [6]
-
[7]
DELPHES 3, A modular framework for fast simulation of a generic collider experiment
J. de Favereau, C. Delaere, P. Demin, A. Giammanco, V. Lemaˆ ıtre, A. Mertens et al., DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [ 1307.6346]
work page internal anchor Pith review Pith/arXiv arXiv 2014
- [8]
-
[9]
M.e.a. Aaboud, Observation of Higgs boson production in association with a top quark pair at the LHC with the ATLAS detector, Physics Letters B 784 (2018) 173–191
work page 2018
-
[10]
Sirunyan, Observation of t¯tH production, Physical Review Letters 120 (2018)
A.e.a. Sirunyan, Observation of t¯tH production, Physical Review Letters 120 (2018)
work page 2018
-
[11]
V˚ age,Performance and training bias of BDT vs NN in ttH(bb) search at ATLAS, Sept., 2018
L.H. V˚ age,Performance and training bias of BDT vs NN in ttH(bb) search at ATLAS, Sept., 2018
work page 2018
- [12]
- [13]
-
[14]
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
T. Dao, D.Y. Fu, S. Ermon, A. Rudra and C. R´ e, FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness, 2205.14135
work page internal anchor Pith review Pith/arXiv arXiv
-
[15]
D.E. Rumelhart, G.E. Hinton and R.J. Williams, Learning representations by back-propagating errors, Nature 323 (1986) 533
work page 1986
-
[16]
L. Builtjes, S. Caron, P. Moskvitina, C. Nellist, R.R. de Austri, R. Verheyen et al., Attention to the strengths of physical interactions: Transformer and graph-based event classification for particle physics experiments, SciPost Phys. 19 (2025) 028 [ 2211.05143]. – 10 –
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.