pith. machine review for the scientific record. sign in

arxiv: 2604.11574 · v1 · submitted 2026-04-13 · ✦ hep-ex · hep-ph· nucl-ex

Recognition: unknown

SemiCharmTag: a tool for Semileptonic Charm tagging

Authors on Pith no claims yet

Pith reviewed 2026-05-10 16:01 UTC · model grok-4.3

classification ✦ hep-ex hep-phnucl-ex
keywords semileptonic charm decaysDrell-Yan measurementsbackground rejectionLHCb experimentsecondary vertex taggingdimuon analysischarm tagging
0
0 comments X

The pith

A secondary vertex tagging method using hadron tracks improves charm background rejection in Drell-Yan dimuon measurements by a factor of about four.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a technique to tag or reject muons from semileptonic charm decays by identifying a hadron track from the same secondary vertex. Developed using full LHCb simulations for proton-proton collisions at 13.6 TeV, it targets the dimuon invariant mass range of 2.9 to 5 GeV/c² with muon pT above 1 GeV/c. This approach achieves a fourfold improvement in signal-to-background ratio at 81% efficiency while keeping bias on the Drell-Yan signal small. It also provides a way to create pure samples of charm-decay muons with low Drell-Yan contamination. Such tools matter for precision measurements of Drell-Yan processes that probe fundamental interactions at low masses.

Core claim

The authors present a novel strategy for background rejection in dimuon Drell-Yan measurements by tagging secondary vertices with hadron tracks, achieving a signal over background improvement of a factor ∼4 at an efficiency of 81% with minimal bias on the Drell-Yan signal properties. They also describe a second approach for constructing unbiased background-pure samples of single muons from charm decays with a charm efficiency of 21.4% at a Drell-Yan efficiency of 1.1%.

What carries the argument

The SemiCharmTag method, which associates a lepton candidate with a hadron track sharing a reconstructed secondary vertex to identify charm semileptonic decays.

If this is right

  • Enhanced purity of Drell-Yan samples in the specified mass and momentum range leads to more accurate cross-section measurements.
  • The method allows construction of control samples for charm backgrounds with minimal signal contamination.
  • Minimal bias on signal properties preserves the integrity of kinematic distributions for physics analysis.
  • The reported efficiencies provide a quantitative benchmark for background control in similar dimuon analyses.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Application of this tagging in actual collision data could reveal discrepancies between simulation and reality that affect background estimates.
  • The technique might extend to other heavy-flavor background rejections in lepton-based searches at hadron colliders.
  • Improved background control could enable tighter constraints on parton distribution functions from Drell-Yan data.

Load-bearing premise

The full Monte Carlo simulations accurately reproduce the real LHCb detector response, tracking, vertex resolution, and decay kinematics for both charm semileptonic decays and Drell-Yan processes.

What would settle it

Measuring the actual signal-to-background improvement and tagging efficiencies in real LHCb proton-proton collision data and comparing them directly to the simulation predictions would test the validity of the reported performance.

Figures

Figures reproduced from arXiv: 2604.11574 by Alisha Lightbody, Carolina Arata, Imanol Corredoira, Michael Winn.

Figure 1
Figure 1. Figure 1: Top: dimuon IP distributions for dimuons coming from [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Distributions of the different discriminant variables listed in Sec. [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Left: Signal probability score for signal (Drell-Yan, blue) and background (charm, red [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Illustration of the double-tag strategy. The threshold is shown in red. If at least one [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Performance of the tool for signal efficiency as a function of charm (red) and beauty [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Normalized pT (left panel) and rapidity (right panel) distributions for Drell-Yan dimuon candidates before applying the double-tagging strategy (black) and after selection (red), as well as the ratio between both distributions. distributions are of similar size, with the exception of the acceptance edges. The deviations between y∈ [2, 2.3] and y∈ [4.4, 5] can reach up to 25%. The effect of the tagging tech… view at source ↗
Figure 7
Figure 7. Figure 7: Top: Dimuon mass distribution for Drell-Yan (blue), charm (red) and beauty (green) [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Illustration of the single-tag strategy. A tagging muon below a defined threshold, as [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Ratio of normalized IP distributions before and after applying the single-tag strategy, [PITH_FULL_IMAGE:figures/full_fig_p019_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: IP distribution of the probe muon in the inclusive sample (orange), before any [PITH_FULL_IMAGE:figures/full_fig_p020_10.png] view at source ↗
read the original abstract

A method for selecting and/or rejecting leptons from charm semileptonic decays based on the tagging of the secondary vertex using a hadron track is introduced. The method is developed for dimuon Drell-Yan measurements in LHCb using full simulations in proton-proton collisions at $\sqrt{s}=13.6$ TeV. We focus on the invariant mass range between 2.9 and 5 GeV/$c^2$ with single muon transverse momentum larger than 1 GeV/$c$. A novel strategy is detailed for background rejection, achieving an improvement of the signal over background of a factor $\sim 4$ at an efficiency of 81% with minimal bias on the Drell-Yan signal properties. Moreover, a second approach is presented for the construction of unbiased background-pure samples of single muons from charm decays, achieving a charm efficiency of 21.4% at a Drell-Yan efficiency of 1.1%.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces SemiCharmTag, a method for tagging leptons from charm semileptonic decays via secondary-vertex information from an associated hadron track. Developed using full Monte Carlo simulations of 13.6 TeV pp collisions for dimuon Drell-Yan analyses in the 2.9–5 GeV/c² mass window with muon pT > 1 GeV/c, it presents a background-rejection strategy that achieves a signal-to-background improvement of approximately 4 at 81% efficiency with minimal bias on Drell-Yan kinematics, plus a second approach yielding 21.4% charm efficiency at 1.1% Drell-Yan efficiency for constructing pure background samples.

Significance. If the Monte Carlo modeling of vertex resolution, tracking, and kinematics proves accurate for real LHCb data, the tool could meaningfully reduce charm semileptonic backgrounds in Drell-Yan measurements, aiding precision electroweak studies. The factor-of-4 S/B gain at high efficiency is a concrete, potentially useful advance for LHCb analyses, though its practical impact hinges on experimental validation.

major comments (3)
  1. [Abstract] Abstract: The headline performance numbers (factor ∼4 S/B improvement at 81% efficiency; 21.4% charm efficiency at 1.1% Drell-Yan efficiency) are stated without any description of the tagging algorithm, cut definitions, or how systematic uncertainties on these quantities are evaluated.
  2. [Performance studies] Performance studies: All quoted efficiencies and background-rejection factors are extracted exclusively from full MC samples; no data-MC comparisons are presented for the critical observables (secondary-vertex resolution, hadron-track reconstruction efficiency, or decay kinematics) in the stated 2.9–5 GeV/c² window.
  3. [Methodology] Methodology: The central claims rest on the assumption that the MC faithfully reproduces real LHCb detector response for both charm semileptonic decays and Drell-Yan processes; this assumption is load-bearing for applicability to data but is not tested or quantified in the manuscript.
minor comments (2)
  1. The manuscript would benefit from explicit pseudocode or a flow chart for the tagging algorithm to improve reproducibility.
  2. Notation for the two efficiency definitions (charm vs. Drell-Yan) should be standardized across text and any tables.

Simulated Author's Rebuttal

3 responses · 1 unresolved

We thank the referee for the careful and constructive review of our manuscript. We have revised the paper to address the points raised, expanding the abstract, methodology, and performance sections with additional context and caveats while maintaining the focus on the simulation-based tool development.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The headline performance numbers (factor ∼4 S/B improvement at 81% efficiency; 21.4% charm efficiency at 1.1% Drell-Yan efficiency) are stated without any description of the tagging algorithm, cut definitions, or how systematic uncertainties on these quantities are evaluated.

    Authors: We agree that the abstract would benefit from more context. The revised abstract now includes a concise description of the secondary-vertex tagging method using an associated hadron track, along with the main selection criteria (muon pT > 1 GeV/c and dimuon mass 2.9–5 GeV/c²). We have also clarified that the quoted efficiencies and S/B improvement are statistical only from the MC samples, with a note that full systematic uncertainties would require data validation. revision: yes

  2. Referee: [Performance studies] Performance studies: All quoted efficiencies and background-rejection factors are extracted exclusively from full MC samples; no data-MC comparisons are presented for the critical observables (secondary-vertex resolution, hadron-track reconstruction efficiency, or decay kinematics) in the stated 2.9–5 GeV/c² window.

    Authors: The manuscript is a simulation study introducing the tagging tool. We have added a dedicated paragraph in the performance studies section discussing expected data-MC agreement based on published LHCb tracking and vertexing performance, and we explicitly state that direct comparisons for these observables in the relevant mass window are not included here. Such comparisons would require dedicated data samples and analysis, which lies outside the scope of this tool-description paper. revision: partial

  3. Referee: [Methodology] Methodology: The central claims rest on the assumption that the MC faithfully reproduces real LHCb detector response for both charm semileptonic decays and Drell-Yan processes; this assumption is load-bearing for applicability to data but is not tested or quantified in the manuscript.

    Authors: We have expanded the methodology section to detail the simulation chain (generators, detector response, and reconstruction settings) and to explicitly discuss the assumption of MC fidelity. A new paragraph quantifies known LHCb performance metrics for vertex resolution and tracking efficiency and notes potential residual discrepancies; we recommend that users perform data-driven validation when applying the tool to real data. revision: yes

standing simulated objections not resolved
  • The absence of actual data-MC comparison plots for secondary-vertex resolution, tracking efficiency, and kinematics, which cannot be generated within the current simulation-only scope of the manuscript.

Circularity Check

0 steps flagged

No significant circularity; performance metrics are direct outputs from Monte Carlo event counts.

full rationale

The paper presents a tagging algorithm based on secondary-vertex information from a hadron track, with all quoted efficiencies and background-rejection factors obtained by applying the selection to independent full Monte Carlo samples of 13.6 TeV pp collisions. No parameter is fitted to the target signal or background yields and then re-used as a prediction; no self-citation supplies a load-bearing uniqueness theorem or ansatz; and the derivation consists solely of defining the tagging criteria and tabulating their performance on simulated events. This structure is self-contained against external benchmarks and contains no reduction of any claimed result to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central performance claims rest on the accuracy of the full detector simulations in modeling charm semileptonic decays, Drell-Yan production, and LHCb tracking/vertexing in 13.6 TeV pp collisions; no free parameters or new entities are introduced in the abstract.

axioms (1)
  • domain assumption Full Monte Carlo simulations faithfully reproduce detector response, tracking efficiency, vertex resolution, and decay kinematics for the relevant processes.
    All quoted efficiencies and background-rejection factors are derived from these simulations.

pith-pipeline@v0.9.0 · 5476 in / 1426 out tokens · 73201 ms · 2026-05-10T16:01:39.474301+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

45 extracted references · 33 canonical work pages · 3 internal anchors

  1. [1]

    J. J. Ethier and E. R. Nocera, Parton Distributions in Nucleons and Nuclei , Ann. Rev. Nucl. Part. Sci. 70 (2020) 43, arXiv:2001.07722

  2. [2]

    Klasen and H

    M. Klasen and H. Paukkunen, Nuclear PDFs After the First Decade of LHC Data , Ann. Rev. Nucl. Part. Sci. 74 (2024) 49, arXiv:2311.00450

  3. [3]

    TMD Handbook,

    R. Boussarie et al., TMD Handbook, arXiv:2304.03302

  4. [4]

    Salabura and J

    P. Salabura and J. Stroth, Dilepton radiation from strongly interacting systems , Prog. Part. Nucl. Phys. 120 (2021) 103869, arXiv:2005.14589

  5. [5]

    Coquet et al., Intermediate mass dileptons as pre-equilibrium probes in heavy ion collisions, Phys

    M. Coquet et al., Intermediate mass dileptons as pre-equilibrium probes in heavy ion collisions, Phys. Lett. B 821 (2021) 136626, arXiv:2104.07622

  6. [6]

    Bailhache and H

    R. Bailhache and H. Appelsh¨ auser, Dileptons at Colliders as Probes of the Quark–Gluon Plasma, Ann. Rev. Nucl. Part. Sci. 75 (2025) 463, arXiv:2512.10597

  7. [7]

    The Color Glass Condensate

    F. Gelis, E. Iancu, J. Jalilian-Marian, and R. Venugopalan, The Color Glass Conden- sate, Ann. Rev. Nucl. Part. Sci. 60 (2010) 463, arXiv:1002.0333

  8. [8]

    LHCb collaboration, LHCb VELO Upgrade Technical Design Report , CERN-LHCC- 2013-021, 2013

  9. [9]

    Arnaldi et al., NA60 results on thermal dimuons , Eur

    NA60 collaboration, R. Arnaldi et al., NA60 results on thermal dimuons , Eur. Phys. J. C 61 (2009) 711, arXiv:0812.3053

  10. [10]

    CMS collaboration, A. M. Sirunyan et al., Study of Drell-Yan dimuon production in proton-lead collisions at √sNN = 8.16 TeV, JHEP 05 (2021) 182, arXiv:2102.13648

  11. [11]

    Abe et al., Measurement of the average lifetime of B hadrons produced in p¯p collisions at √s = 1.8 TeV, Phys

    CDF collaboration, F. Abe et al., Measurement of the average lifetime of B hadrons produced in p¯p collisions at √s = 1.8 TeV, Phys. Rev. Lett. 71 (1993) 3421

  12. [12]

    Aad et al., Measurement of the low-mass Drell-Yan differ- ential cross section at √s = 7 TeV using the ATLAS detector , JHEP 06 (2014) 112, arXiv:1404.1212

    ATLAS collaboration, G. Aad et al., Measurement of the low-mass Drell-Yan differ- ential cross section at √s = 7 TeV using the ATLAS detector , JHEP 06 (2014) 112, arXiv:1404.1212

  13. [13]

    Acharya et al., Measurement of charged jet cross section in pp collisions at √s = 5.02 TeV, Phys

    ALICE collaboration, S. Acharya et al., Measurement of charged jet cross section in pp collisions at √s = 5.02 TeV, Phys. Rev. D 100 (2019) 092004, arXiv:1905.02536

  14. [14]

    Apolin´ ario, Y.-J

    L. Apolin´ ario, Y.-J. Lee, and M. Winn,Heavy quarks and jets as probes of the QGP , Prog. Part. Nucl. Phys. 127 (2022) 103990, arXiv:2203.16352. 19

  15. [15]

    Navas et al., Review of particle physics , Phys

    Particle Data Group, S. Navas et al., Review of particle physics , Phys. Rev. D110 (2024) 030001

  16. [16]

    Acharyaet al.(ALICE), Charm-quark fragmentation fractions and production cross section at midrapidity in pp collisions at the LHC, Phys

    ALICE collaboration, S. Acharya et al., Charm-quark fragmentation fractions and production cross section at midrapidity in pp collisions at the LHC , Phys. Rev. D 105 (2022) L011103, arXiv:2105.06335

  17. [17]

    Acharya et al

    ALICE collaboration, S. Acharya et al. , Charm production and fragmentation fractions at midrapidity in pp collisions at √s = 13 TeV , JHEP 12 (2023) 086, arXiv:2308.04877

  18. [18]

    Schaelet al.(ALEPH and DELPHI and L3 and OPAL and SLD), Phys

    ALEPH, DELPHI, L3, OPAL, SLD, LEP Electroweak Working Group, SLD Elec- troweak Group, SLD Heavy Flavour Group, S. Schaelet al., Precision electroweak mea- surements on the Z resonance, Phys. Rept. 427 (2006) 257, arXiv:hep-ex/0509008

  19. [19]

    Tuning PYTHIA 8.1: the Monash 2013 Tune

    P. Skands, S. Carrazza, and J. Rojo, Tuning PYTHIA 8.1: the Monash 2013 Tune , Eur. Phys. J. C 74 (2014) 3024, arXiv:1404.5630

  20. [20]

    An Introduction to PYTHIA 8.2

    T. Sj¨ ostrandet al., An introduction to PYTHIA 8.2 , Comput. Phys. Commun. 191 (2015) 159, arXiv:1410.3012

  21. [21]

    Herwig 7.0 / Herwig++ 3.0 Release Note

    J. Bellm et al., Herwig 7.0/Herwig++ 3.0 release note , Eur. Phys. J. C 76 (2016) 196, arXiv:1512.01178

  22. [22]

    Aaij et al

    LHCb collaboration, R. Aaij et al. , Prompt charm production in pp collisions at sqrt(s)=7 TeV, Nucl. Phys. B 871 (2013) 1, arXiv:1302.2864

  23. [23]

    Li, X.-R

    P.-R. Li, X.-R. Lyu, and Y. Zheng, Experimental overview on the charmed baryon decays, Chin. Phys. C 50 (2026) 022002, arXiv:2509.19141

  24. [24]

    Aaij et al

    LHCb collaboration, R. Aaij et al. , Measurement of the B0 s → µ+µ− branching fraction and effective lifetime and search for B0 → µ+µ− decays, Phys. Rev. Lett. 118 (2017) 191801, arXiv:1703.05747

  25. [25]

    Aaijet al.(LHCb), Phys

    LHCb collaboration, R. Aaij et al., Search for Dark Photons Produced in 13 TeV pp Collisions, Phys. Rev. Lett. 120 (2018) 061801, arXiv:1710.02867

  26. [26]

    A Brief Introduction to PYTHIA 8.1

    T. Sjostrand, S. Mrenna, and P. Z. Skands, A Brief Introduction to PYTHIA 8.1 , Comput. Phys. Commun. 178 (2008) 852, arXiv:0710.3820

  27. [27]

    Belyaev et al., Handling of the generation of primary events in Gauss, the LHCb simulation framework , CERN, Geneva, 2011

    I. Belyaev et al., Handling of the generation of primary events in Gauss, the LHCb simulation framework , CERN, Geneva, 2011. LHCb-PROC-2011-005, CERN-LHCb- PROC-2011-005

  28. [28]

    D. J. Lange, The EvtGen particle decay simulation package , Nucl. Instrum. Meth. A462 (2001) 152

  29. [29]

    Golonka and Z

    P. Golonka and Z. Was, PHOTOS Monte Carlo: A precision tool for QED corrections in Z and W decays, Eur. Phys. J. C45 (2006) 97, arXiv:hep-ph/0506026

  30. [30]

    Agostinelli et al., Geant4: A simulation toolkit, Nucl

    Geant4 collaboration, S. Agostinelli et al., Geant4: A simulation toolkit, Nucl. Instrum. Meth. A506 (2003) 250. 20

  31. [31]

    Allison et al., Geant4 developments and applications , IEEE Trans

    Geant4 collaboration, J. Allison et al., Geant4 developments and applications , IEEE Trans. Nucl. Sci. 53 (2006) 270

  32. [32]

    Clemencic et al., The LHCb simulation application, Gauss: Design, evolution and experience, J

    M. Clemencic et al., The LHCb simulation application, Gauss: Design, evolution and experience, J. Phys. Conf. Ser. 331 (2011) 032023

  33. [33]

    Barrand et al

    G. Barrand et al. , Gaudi — a software architecture and framework for building hep data processing applications, Computer Physics Communications 140 (2001) 45, CHEP2000

  34. [34]

    Clemencic et al., Recent developments in the LHCb software framework gaudi , Journal of Physics: Conference Series 219 (2010) 042006

    M. Clemencic et al., Recent developments in the LHCb software framework gaudi , Journal of Physics: Conference Series 219 (2010) 042006

  35. [35]

    J. Alwall et al., The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations , JHEP 07 (2014) 079, arXiv:1405.0301

  36. [36]

    Frederix, S

    R. Frederix et al., The automation of next-to-leading order electroweak calculations , JHEP 07 (2018) 185, arXiv:1804.10017, [Erratum: JHEP 11, 085 (2021)]

  37. [37]

    R. D. Ball et al., Parton distributions for the LHC run2 , Journal of High Energy Physics 2015 (2015)

  38. [38]

    Cacciari, S

    M. Cacciari et al., Theoretical predictions for charm and bottom production at the LHC, JHEP 10 (2012) 137, arXiv:1205.6344

  39. [39]

    Cacciari, M

    M. Cacciari, M. L. Mangano, and P. Nason, Gluon PDF constraints from the ratio of forward heavy-quark production at the LHC at √s = 7 and 13 TeV, Eur. Phys. J. C 75 (2015) 610, arXiv:1507.06197

  40. [40]

    LHCb collaboration, DaVinci application GitLab repository , https://gitlab.cern.ch/lhcb/DaVinci/

  41. [41]

    Anderlini et al., Muon identification for LHCb Run 3 , JINST 15 (2020) T12005, arXiv:2008.01579

    L. Anderlini et al., Muon identification for LHCb Run 3 , JINST 15 (2020) T12005, arXiv:2008.01579

  42. [42]

    Proceedings of the 22nd

    T. Chen and C. Guestrin, XGBoost: A Scalable Tree Boosting System , doi: 10.1145/2939672.2939785 arXiv:1603.02754

  43. [43]

    Aaij et al., Identification of charm jets at LHCb , JINST 17 (2022) P02028, arXiv:2112.08435

    LHCb collaboration, R. Aaij et al., Identification of charm jets at LHCb , JINST 17 (2022) P02028, arXiv:2112.08435

  44. [44]

    LHCb collaboration, R. Aaij et al., Measurements of prompt charm production cross- sections in pp collisions at √s = 13 TeV, JHEP 03 (2016) 159, arXiv:1510.01707, [Erratum: JHEP 09, 013 (2016), Erratum: JHEP 05, 074 (2017)]

  45. [45]

    Aaij et al., Measurement of the B± production cross-section in pp collisions at √s = 7 and 13 TeV , JHEP 12 (2017) 026, arXiv:1710.04921

    LHCb collaboration, R. Aaij et al., Measurement of the B± production cross-section in pp collisions at √s = 7 and 13 TeV , JHEP 12 (2017) 026, arXiv:1710.04921. 21