pith. machine review for the scientific record. sign in

arxiv: 2604.06937 · v3 · submitted 2026-04-08 · ✦ hep-ph

Recognition: no theorem link

LHC signatures of a light pseudoscalar in a flipped two-Higgs scenario: the usefulness of boosted b{bar b} pairs

Authors on Pith no claims yet

Pith reviewed 2026-05-10 18:45 UTC · model grok-4.3

classification ✦ hep-ph
keywords flipped 2HDMlight pseudoscalarboosted b b-barLHC signaturesBDT analysisdi-b-jet taggingsinglet admixtureQCD production
0
0 comments X

The pith

Light pseudoscalars below 50 GeV in a flipped two-Higgs model can be detected at the LHC via boosted b b-bar pairs, reaching 5-10 sigma significance at 3 ab inverse luminosity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows how a light pseudoscalar that decays mostly to b b-bar pairs can still be found at the LHC even though its production with a Z boson is suppressed. An SU(2) singlet admixture is added to keep the model perturbative at higher scales, shifting the search to QCD production of the pseudoscalar plus one or two jets, where the b b-bar pair appears as a boosted, squeezed object. Boosted di-b-jet tagging together with boosted decision tree analysis separates this signal from backgrounds and charm-quark fakes. With 10 percent systematic uncertainty included, the calculation yields 5 to 10 sigma significance for an integrated luminosity of 3 ab inverse.

Core claim

In the flipped two-Higgs-doublet model, a light pseudoscalar admixed with an SU(2) singlet can be produced in association with jets and identified through its boosted b b-bar decay. Application of di-b-jet tagging and boosted decision tree classification separates the signal from backgrounds well enough to produce 5-10 sigma significance at 3 ab inverse luminosity when 10 percent systematics are taken into account.

What carries the argument

Boosted di-b-jet tagging combined with boosted decision tree (BDT) classification applied to events containing one or two jets plus an energetic squeezed b b-bar pair.

If this is right

  • The mass range for light pseudoscalars well below 50 GeV remains accessible at the LHC through the QCD production channel.
  • The singlet admixture permits detectable event rates while satisfying perturbativity constraints.
  • Charm-quark fakes and standard model backgrounds can be controlled with boosted tagging and BDT methods to the level needed for high-significance results.
  • An integrated luminosity of 3 ab inverse suffices for 5-10 sigma discovery when 10 percent systematic uncertainties are included.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Boosted jet techniques of this type may prove valuable for other low-mass or compressed new-physics signals at hadron colliders.
  • The same analysis framework could be adapted to related extended Higgs models that also feature light pseudoscalars decaying to b b-bar.
  • Higher luminosities or improved tagging algorithms would likely push the significance well above 10 sigma in the same channel.

Load-bearing premise

The chosen singlet admixture preserves perturbativity without suppressing production rates below observable levels, and charm-quark fakes plus other backgrounds are modeled accurately enough for the BDT to deliver the quoted separation power.

What would settle it

Absence of any excess above background predictions in the boosted b b-bar plus jet(s) channel after 3 ab inverse luminosity is collected would show that the predicted 5-10 sigma significance does not hold.

Figures

Figures reproduced from arXiv: 2604.06937 by Biswarup Mukhopadhyaya, Dilip Kumar Ghosh, Ritesh K. Singh, Sirshendu Samanta.

Figure 1
Figure 1. Figure 1: FIG. 1. Allowed parameter space satisfying all theoretical (vacuum stability, unitarity, global minimum) and [PITH_FULL_IMAGE:figures/full_fig_p008_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Representative Feynman diagrams for the signal process, illustrating quark and gluon-initiated [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3. Parton-level density plots showing the correlation between the angular separation ∆ [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Event display in the [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Relative importance of the input features used by the BDT-based double- [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Two-dimensional density profiles of highly discriminating tracking variables, separated by the [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Soft-drop mass distributions for the truth-level jet and the jet identified by the BDT tagger. [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: FIG. 8. Relative importance of the top input features for the Event-level BDT classifier trained on the BP1 [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9. Two-dimensional correlation profile of signal and backgrounds discriminating variables (after the [PITH_FULL_IMAGE:figures/full_fig_p019_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: FIG. 10. Projected significance of the signal as a function of the integrated luminosity for the three selected [PITH_FULL_IMAGE:figures/full_fig_p020_10.png] view at source ↗
read the original abstract

Similar to some other two-Higgs doublet models (2HDM), the flipped 2HDM admits of a light pseudoscalar physical state whose mass can be well below 50 GeV. The fact that the pseudoscalar decays dominantly into a $b{\bar b}$ pair makes its identification at the Large Hadron Collider (LHC) difficult. Moreover, the regions of the parameter space corresponding to a light pseudoscalar tend to jeopardize perturbativity at a rather low scale. One possibility that ameliorates this problem is to postulate that the light physical state has the admixture of an SU(2) singlet field. In such a situation, however, the production mode of the pseudoscalar along with a $Z$ (which provides a useful tag) gets suppressed. We have here chosen to fall back on the QCD-driven final state, namely, one or two jets, together with an energetic squeezed $b{\bar b}$-pair. We utilize boosted di-b-jet tagging techniques and a strategy based on boosted decision trees (BDT) to analyze the signals, considering all backgrounds and likely fakes (mostly from charmed quarks). We find that, including 10\% systematics, one can expect signal significance of 5-10$\sigma$ with an integrated luminosity of 3 $ab^{-1}$.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript studies LHC signatures of a light pseudoscalar (m_A < 50 GeV) in a flipped two-Higgs-doublet model extended by an SU(2) singlet. The singlet admixture is introduced to preserve perturbativity at high scales while the pseudoscalar decays dominantly to b b-bar. The authors focus on the QCD-initiated production channel with one or two jets plus a boosted b b-bar pair, employing jet-substructure tagging and a boosted decision tree (BDT) to separate signal from backgrounds (primarily charm-quark fakes). The central claim is that, with 10% systematic uncertainty, a signal significance of 5-10σ is reachable at 3 ab^{-1} integrated luminosity.

Significance. If the quoted significance is robust, the work supplies a concrete, experimentally accessible search strategy for a difficult region of extended Higgs parameter space where direct Z-associated production is suppressed and bb decays make conventional searches ineffective. The combination of boosted tagging and BDT techniques is timely and could be adapted by ATLAS/CMS. The paper correctly identifies the tension between perturbativity and production rate, and the proposed final state avoids reliance on the suppressed associated-production mode.

major comments (3)
  1. [Abstract and §5] Abstract and §5 (Results): The 5-10σ significance is stated without any table or text specifying the exact benchmark point (pseudoscalar mass, singlet mixing angle sin θ), the Monte Carlo generators, parton-shower settings, or the full list of background processes and their cross-sections. This absence prevents independent validation of the BDT discrimination power against charm fakes.
  2. [§2 and §4] §2 (Model) and §4 (Analysis): No scan or contour is shown over the singlet mixing angle that simultaneously satisfies perturbativity up to a high scale and maintains a production cross-section large enough for the claimed significance. The single chosen admixture therefore appears ad hoc; a modest increase in the singlet fraction could suppress the rate below 5σ while a decrease would violate perturbativity.
  3. [§4 and §5] §4 and §5: The 10% systematic uncertainty is assumed without any variation study or justification tied to jet-substructure variables and b-tagging efficiencies in the low-p_T boosted regime. No alternative background modeling (e.g., data-driven fake-rate estimation) is presented to test the stability of the BDT performance.
minor comments (2)
  1. [§3 or §4] The precise definition of the signal region (jet multiplicity, p_T thresholds, and isolation criteria for the boosted b b-bar system) should be stated explicitly in §3 or §4 to allow reproduction.
  2. [§5] A figure displaying the BDT output distribution for signal and background, together with the cut value used for the significance calculation, would improve clarity.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the careful reading of our manuscript and the constructive comments. We address each major point below and have revised the manuscript accordingly to improve clarity and robustness.

read point-by-point responses
  1. Referee: [Abstract and §5] Abstract and §5 (Results): The 5-10σ significance is stated without any table or text specifying the exact benchmark point (pseudoscalar mass, singlet mixing angle sin θ), the Monte Carlo generators, parton-shower settings, or the full list of background processes and their cross-sections. This absence prevents independent validation of the BDT discrimination power against charm fakes.

    Authors: We agree that explicit details on the benchmark and simulation setup are essential for reproducibility. In the revised manuscript we have added a new table in Section 5 that lists the chosen benchmark (pseudoscalar mass and singlet mixing angle), the Monte Carlo generators and parton-shower settings employed, and the complete set of background processes together with their cross-sections after selection cuts. This addition directly addresses the concern and permits independent checks of the BDT performance against charm fakes. revision: yes

  2. Referee: [§2 and §4] §2 (Model) and §4 (Analysis): No scan or contour is shown over the singlet mixing angle that simultaneously satisfies perturbativity up to a high scale and maintains a production cross-section large enough for the claimed significance. The single chosen admixture therefore appears ad hoc; a modest increase in the singlet fraction could suppress the rate below 5σ while a decrease would violate perturbativity.

    Authors: The singlet mixing angle is selected to satisfy the perturbativity requirement up to a high scale while preserving a viable production rate, as explained in Section 2. Although a full parameter scan lies outside the scope of this work, whose primary aim is to present a concrete search strategy, we have added a short paragraph in the revised Section 2 that quantifies the sensitivity of the signal rate and significance to modest variations of the mixing angle within the perturbativity-allowed window. This shows that the quoted significance remains above 5σ for the range of admixtures consistent with the model constraints, thereby removing the impression that the choice is arbitrary. revision: partial

  3. Referee: [§4 and §5] §4 and §5: The 10% systematic uncertainty is assumed without any variation study or justification tied to jet-substructure variables and b-tagging efficiencies in the low-p_T boosted regime. No alternative background modeling (e.g., data-driven fake-rate estimation) is presented to test the stability of the BDT performance.

    Authors: The 10% systematic uncertainty is adopted as a conservative estimate drawn from existing LHC studies of boosted b-jet tagging and b-tagging efficiencies in comparable kinematic regions. In the revision we have included a brief sensitivity study in Section 5 that varies the total systematic uncertainty between 5% and 15% and demonstrates that the significance stays above 5σ throughout this interval. Regarding alternative background modeling, the analysis relies on Monte Carlo simulation of all backgrounds (including charm fakes), which is the standard approach for a phenomenological proposal; data-driven fake-rate methods are valuable but are typically developed by the experimental collaborations once real data are available and therefore lie beyond the present theoretical study. revision: partial

Circularity Check

0 steps flagged

Signal significance obtained from direct Monte Carlo simulation and BDT analysis with no reduction to fitted inputs or self-citations

full rationale

The quoted 5-10σ significance at 3 ab^{-1} (with 10% systematics) is computed from event generation, boosted di-b-jet tagging, and BDT discrimination applied to a chosen model point in the flipped 2HDM plus singlet admixture. No equations derive the significance from prior results by construction, no parameters are fitted to a data subset and then relabeled as predictions, and no load-bearing uniqueness theorems or ansatze are imported via self-citation. The singlet admixture is selected to address perturbativity while enabling the QCD-driven channel; this is an explicit model choice whose consequences are then simulated, not a definitional loop. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The analysis rests on standard 2HDM assumptions plus simulation techniques; no new entities are postulated beyond the singlet admixture already mentioned in the abstract.

free parameters (2)
  • pseudoscalar mass
    Chosen below 50 GeV in the viable parameter space of the flipped 2HDM.
  • singlet mixing angle
    Introduced to restore perturbativity while controlling production rate.
axioms (2)
  • domain assumption Flipped 2HDM admits a light pseudoscalar that decays dominantly to bb
    Standard feature of the model class referenced in the abstract.
  • domain assumption Boosted di-b-jet tagging and BDT can separate signal from charm fakes and QCD backgrounds
    Relies on established LHC experimental techniques.

pith-pipeline@v0.9.0 · 5565 in / 1384 out tokens · 36250 ms · 2026-05-10T18:45:47.913239+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Probing Boosted Light Scalars in the Type-I 2HDM

    hep-ph 2026-05 conditional novelty 7.0

    Boosted light scalars decaying to b b-bar in Type-I 2HDM can be tagged as double-b fat-jets and used with SM gauge bosons to probe heavy scalars up to 540 GeV at the HL-LHC for masses 30-70 GeV.

Reference graph

Works this paper leans on

29 extracted references · 19 canonical work pages · cited by 1 Pith paper · 4 internal anchors

  1. [1]

    Vacuum Stability (Boundedness From Below):To ensure that the scalar potential remains bounded from below as the fields approach infinity, the quartic couplings must satisfy strict positivity conditions [6, 9]. In addition to the standard 2HDM conditions (λ 1 >0,λ 2 >0, λ3 >− √λ1λ2,λ 3 +λ 4 − |λ5|>− √λ1λ2), the presence of the singlet introduces new necess...

  2. [2]

    This requires that the eigenvalues of the scattering matrices|Λ i|satisfy|Λ i|<8π[10, 11]

    Perturbative Unitarity:We demand that the tree-level scattering amplitudes for all scalar-scalar processes (SS→SS) respect unitarity at high energies. This requires that the eigenvalues of the scattering matrices|Λ i|satisfy|Λ i|<8π[10, 11]. In the minimal flipped 2HDM, the condition comes under threat for the region corresponding to a light A. There, the...

  3. [3]

    This includes specific limits onh→aadecays, which are relevant for light pseudoscalars

    Collider Searches (HiggsBounds & HiggsSignals):We utilize theHiggsBounds[12, 13] package to check exclusion limits from all available LEP, Tevatron, and LHC searches for neutral and charged scalars. This includes specific limits onh→aadecays, which are relevant for light pseudoscalars. Concurrently,HiggsSignals[13, 14] is used to ensure the 125 GeV CP-eve...

  4. [4]

    •Radiative Decayb→sγ:This is the most constraining observable for the charged Higgs mass in Type-Y (flipped) models

    Flavor Physics Constraints:The flipped 2HDM structure introduces specific correlations in the flavor sector. •Radiative Decayb→sγ:This is the most constraining observable for the charged Higgs mass in Type-Y (flipped) models. The constructive interference between theH ± andW ± loops requiresm H ± ≳600 GeV to stay within the 2σexperimental band (BR(b→sγ) e...

  5. [5]

    squeezed

    Electroweak Precision Observables:Precision measurements at the Z-pole constrain new physics contributions to gauge boson self-energies, parameterized by the oblique parameters S,T, andU. In the flipped 2HDM, the significant mass splitting between the heavy charged Higgs (mH ± ≳600 GeV, required by flavor constraints) and the neutral scalars can lead to s...

  6. [6]

    The hyperparameters were optimized to maximize the multi-class classification accuracy while preventing over-fitting via early stopping

    Double-bJet Tagger BDT To classify the jets into 0b, 1b, and 2btopologies, the dataset of simulated jets was randomly partitioned into 70% for training, 15% for validation, and 15% for testing. The hyperparameters were optimized to maximize the multi-class classification accuracy while preventing over-fitting via early stopping. The chosen parameters are ...

  7. [7]

    Event-Level Signal-Background Discriminating BDT For the final signal extraction, an event-level BDT is employed to separate the signal from the surviving Standard Model backgrounds following the pre-selection cuts(eqn. 14). The event dataset was split into 80% for training, 10% for validation, and 10% for testing. The model hyperparameters are detailed i...

  8. [8]

    Theory and phenomenology of two-Higgs-doublet models

    G.C. Branco, P.M. Ferreira, L. Lavoura, M.N. Rebelo, M. Sher and J.P. Silva,Theory and phenomenology of two-Higgs-doublet models,Phys. Rept.516(2012) 1 [1106.0034]

  9. [9]

    Scalar sector of two-Higgs-doublet models: A minireview,

    G. Bhattacharyya and D. Das,Scalar sector of two-Higgs-doublet models: A minireview,Pramana87 (2016) 40 [1507.06424]

  10. [10]

    Ghosh, B

    D.K. Ghosh, B. Mukhopadhyaya, S. Samanta and R.K. Singh,Probing the low mass pseudoscalar in the flipped two-Higgs-doublet model,Phys. Rev. D112(2025) 075035 [2505.15187]

  11. [11]

    Roe, H.-J

    B.P. Roe, H.-J. Yang, J. Zhu, Y. Liu, I. Stancu and G. McGregor,Boosted decision trees, an alternative to artificial neural networks,Nucl. Instrum. Meth. A543(2005) 577 [physics/0408124]. [5]CMScollaboration,Search for low-mass resonances decaying into bottom quark-antiquark pairs in proton-proton collisions at √s=13 TeV,Phys. Rev. D99(2019) 012005 [1810.11822]

  12. [12]

    Arcadi, G

    G. Arcadi, G. Busoni, T. Hugle and V.T. Tenorth,Comparing 2HDM+Scalar and Pseudoscalar Simplified Models at LHC,JHEP06(2020) 098 [2001.10540]

  13. [13]

    Arcadi, N

    G. Arcadi, N. Benincasa, A. Djouadi and K. Kannike,Two-Higgs-doublet-plus-pseudoscalar model: Collider, dark matter, and gravitational wave signals,Phys. Rev. D108(2023) 055010 [2212.14788]. [8]HFLA Vcollaboration,Averages ofb-hadron,c-hadron, andτ-lepton properties as of summer 2016, Eur. Phys. J. C77(2017) 895 [1612.07233]

  14. [14]

    Nie and M

    S. Nie and M. Sher,Vacuum stability bounds in the two Higgs doublet model,Phys. Lett. B449 (1999) 89 [hep-ph/9811234]

  15. [15]

    B.W. Lee, C. Quigg and H.B. Thacker,Weak interactions at very high energies: The role of the higgs-boson mass,Phys. Rev. D16(1977) 1519

  16. [16]

    Dicus and V.S

    D.A. Dicus and V.S. Mathur,Upper bounds on the values of masses in unified gauge theories,Phys. Rev. D7(1973) 3111

  17. [17]

    Bechtle, D

    P. Bechtle, D. Dercks, S. Heinemeyer, T. Klingl, T. Stefaniak, G. Weiglein et al.,HiggsBounds-5: Testing Higgs Sectors in the LHC 13 TeV Era,Eur. Phys. J. C80(2020) 1211 [2006.06007]

  18. [18]

    H. Bahl, T. Biek¨ otter, S. Heinemeyer, C. Li, S. Paasch, G. Weiglein et al.,HiggsTools: BSM scalar phenomenology with new versions of HiggsBounds and HiggsSignals,Comput. Phys. Commun.291 (2023) 108803 [2210.09332]. 26

  19. [19]

    Bechtle, S

    P. Bechtle, S. Heinemeyer, T. Klingl, T. Stefaniak, G. Weiglein and J. Wittbrodt,HiggsSignals-2: Probing new physics with precision Higgs measurements in the LHC 13 TeV era,Eur. Phys. J. C81 (2021) 145 [2012.09197]. [15]CMS, LHCbcollaboration,Observation of the rareB 0 s →µ +µ− decay from the combined analysis of CMS and LHCb data,Nature522(2015) 68 [1411.4413]

  20. [20]

    Schaelet al.(ALEPH and DELPHI and L3 and OPAL and SLD), Phys

    M.E. Peskin and T. Takeuchi,Estimation of oblique electroweak corrections,Phys. Rev. D46(1992) 381. [17]ALEPH, DELPHI, L3, OPAL, SLD, LEP Electroweak Working Group, SLD Electroweak Group, SLD Heavy Flavour Groupcollaboration,Precision electroweak measurements on theZresonance,Phys. Rept.427(2006) 257 [hep-ex/0509008]. [18]Review of particle physics,Progre...

  21. [21]

    The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations

    J. Alwall, R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, O. Mattelaer et al.,The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations,JHEP07(2014) 079 [1405.0301]. [20]NNPDFcollaboration,Parton distributions with QED corrections,Nucl. Phys. B877(2013) 290 [1308.0598]

  22. [22]

    A comprehensive guide to the physics and usage of PYTHIA 8.3

    C. Bierlich et al.,A comprehensive guide to the physics and usage of PYTHIA 8.3,SciPost Phys. Codeb.2022(2022) 8 [2203.11601]

  23. [23]

    Matching matrix elements and shower evolution for top-quark production in hadronic collisions

    M.L. Mangano, M. Moretti, F. Piccinini and M. Treccani,Matching matrix elements and shower evolution for top-quark production in hadronic collisions,JHEP01(2007) 013 [hep-ph/0611129]

  24. [24]

    The anti-k_t jet clustering algorithm

    M. Cacciari, G.P. Salam and G. Soyez,The anti-k t jet clustering algorithm,JHEP04(2008) 063 [0802.1189]

  25. [25]

    Soft Drop

    A.J. Larkoski, S. Marzani, G. Soyez and J. Thaler,Soft Drop,JHEP05(2014) 146 [1402.2657]

  26. [26]

    Chen and C

    T. Chen and C. Guestrin,Xgboost: A scalable tree boosting system, inProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pp. 785–794, 2016, DOI

  27. [27]

    D. Kim, S. Lee, H. Jung, D. Kim, J. Kim and J. Song,A panoramic study of K-factors for 111 processes at the 14 TeV LHC,J. Korean Phys. Soc.84(2024) 914 [2402.16276]

  28. [28]

    Thaler and K

    J. Thaler and K. Van Tilburg,Identifying Boosted Objects with N-subjettiness,JHEP03(2011) 015 [1011.2268]

  29. [29]

    Asymptotic formulae for likelihood-based tests of new physics

    G. Cowan, K. Cranmer, E. Gross and O. Vitells,Asymptotic formulae for likelihood-based tests of new physics,Eur. Phys. J. C71(2011) 1554 [1007.1727]