pith. machine review for the scientific record. sign in

arxiv: 2511.19633 · v2 · submitted 2025-11-24 · ✦ hep-ph

An NLO-Matched Initial and Final State Parton Shower on a GPU

Pith reviewed 2026-05-17 05:41 UTC · model grok-4.3

classification ✦ hep-ph
keywords parton showerGPU computingNLO matchingMonte Carlo event generatorsZ boson productionLHC simulationsCUDAenergy efficiency
0
0 comments X

The pith

A GPU-based parton shower matches the speed and energy use of a 96-core CPU cluster for NLO Z production at the LHC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces version 2 of the GAPS generator, written in CUDA C++ to run initial- and final-state parton showers on a GPU while including hard-process matching at next-to-leading order. A nearly identical C++ version runs on ordinary CPUs for direct comparison. When both codes simulate NLO Z production at the LHC, a single NVIDIA V100 GPU delivers the same event rate and consumes roughly the same energy as a 96-core cluster built from two Intel Xeon Gold 5220R processors. A sympathetic reader cares because the result points to a practical way to reduce the hardware and power demands of large Monte Carlo simulations in high-energy physics.

Core claim

The authors release GAPS version 2, a CUDA C++ program that performs initial- and final-state parton showering on a GPU together with NLO hard-process matching. They supply a matching single-thread and multi-thread C++ implementation for CPUs. Benchmark runs of NLO Z production at the LHC show that the speed and energy consumption of one NVIDIA V100 GPU are on par with those of a 96-core cluster of two Intel Xeon Gold 5220R processors.

What carries the argument

The GAPS v2 CUDA C++ parton shower event generator that performs initial- and final-state emissions on a GPU with NLO matching.

If this is right

  • Monte Carlo event generators can be ported to GPUs without sacrificing numerical agreement with established CPU codes.
  • NLO-matched parton-shower simulations become feasible on single-GPU workstations rather than large CPU clusters.
  • Energy consumption for high-luminosity LHC analyses can be lowered by moving showering workloads to GPU hardware.
  • The same CUDA framework can be extended to other initial- and final-state processes that require NLO matching.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The demonstrated equivalence between GPU and CPU results supports direct substitution in existing analysis chains that already rely on the CPU version.
  • Similar GPU ports could be applied to other components of full event generators, such as hadronization or underlying-event modeling.
  • The energy-efficiency gain may become more pronounced when the same code runs on newer GPU architectures with higher core counts.

Load-bearing premise

The GPU and CPU implementations produce numerically equivalent physics results, with no hidden differences in random-number generation, phase-space sampling, or matching that would alter observable distributions.

What would settle it

A side-by-side comparison of binned distributions such as jet transverse-momentum spectra or jet multiplicities from identical NLO Z events generated on the GPU and on the CPU that reveals statistically significant discrepancies.

Figures

Figures reproduced from arXiv: 2511.19633 by Michael H. Seymour, Siddharth Sule.

Figure 1
Figure 1. Figure 1: The updated parallelised veto algorithm. The PDF evaluations are required [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Partitioning the event record list. In this case, [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Z Observables and Anti-kT jets produced with R = 0.4. The Z and lepton observables are fully inclusive, while each jet’s pT distribution is shown when its |η| < 5, its η distribution is shown when its pT > 5 GeV, and the ∆R and multiplicity distributions are shown when pT > 5 GeV and |η| < 5. The Z and lepton observables agree very well with Herwig, the leading jet pretty well, the second and third jets sl… view at source ↗
Figure 4
Figure 4. Figure 4: NLO+Shower for the process pp → Z, where the Z boson is on-shell and stable. Like the LO+Shower case, the Z boson observables are in agreement. The jet observables also contained the same deviations and are omitted here. 4.2 GPU Profiling and Impact of Computational Improvements Similar to our previous work, we used the NVIDIA V100 [14], which has 32 cores per warp and 64 warps per streaming multiprocessor… view at source ↗
Figure 5
Figure 5. Figure 5: Kernel Tuning Results, with partitioning on and off. For small [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Execution Time, Average power consumption and total energy consump [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: PDF Ratio Results for c → g, b → g, g → d and g → u splittings. The CT14lo set was used for the simulations. The chosen process was Z production at NLO, to incorporate the phase space for the power shower. The results show that the fitted equation overestimates the majority of the data points. The code to generate the data for this can be found in the C++ implementation, and the plotting code is in plot-pd… view at source ↗
read the original abstract

Recent developments have demonstrated the potential for high simulation speeds and reduced energy consumption by porting Monte Carlo Event Generators to GPUs. We release version 2 of the CUDA C++ parton shower event generator GAPS, which can simulate initial and final state emissions on a GPU and is capable of hard-process matching. As before, we accompany the generator with a near-identical C++ generator to run simulations on single-core and multi-core CPUs. Using these programs, we simulate NLO Z production at the LHC and demonstrate that the speed and energy consumption of an NVIDIA V100 GPU are on par with a 96-core cluster composed of two Intel Xeon Gold 5220R Processors, providing a potential alternative to cluster computing.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The paper presents version 2 of the GAPS CUDA C++ parton shower event generator, which implements initial- and final-state emissions on GPU together with NLO hard-process matching. A near-identical C++ reference implementation is provided for single- and multi-core CPU execution. The authors perform a benchmark simulation of NLO Z production at the LHC and report that the speed and energy consumption of a single NVIDIA V100 GPU are comparable to those of a 96-core CPU cluster built from two Intel Xeon Gold 5220R processors.

Significance. If the GPU and CPU implementations produce numerically equivalent physics results, the work supplies a concrete, reproducible demonstration that GPU acceleration can match the throughput of a sizable CPU cluster for a realistic NLO-matched parton-shower process while offering lower energy consumption. The release of both the CUDA and C++ codes, together with the explicit side-by-side timing on the same physics process, strengthens the practical value of the result for the Monte Carlo event-generation community.

major comments (1)
  1. [Abstract and results section] Abstract and results section: the central performance claim rests on the assertion that the GPU and CPU codes produce equivalent physics for NLO-matched Z production, yet the manuscript supplies no quantitative metrics (e.g., Kolmogorov-Smirnov distances, pull distributions, or statistical uncertainties) comparing observable distributions, nor any validation plots that would confirm agreement within Monte Carlo errors. Without this evidence the benchmark comparison cannot be fully interpreted.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful review and the constructive comment on the need for explicit validation of numerical equivalence. We have revised the manuscript to incorporate the requested quantitative metrics and plots.

read point-by-point responses
  1. Referee: [Abstract and results section] Abstract and results section: the central performance claim rests on the assertion that the GPU and CPU codes produce equivalent physics for NLO-matched Z production, yet the manuscript supplies no quantitative metrics (e.g., Kolmogorov-Smirnov distances, pull distributions, or statistical uncertainties) comparing observable distributions, nor any validation plots that would confirm agreement within Monte Carlo errors. Without this evidence the benchmark comparison cannot be fully interpreted.

    Authors: We agree that quantitative validation metrics are important for fully interpreting the benchmark results. In the revised manuscript we have added a dedicated validation subsection to the results section. This includes overlaid histograms for the Z-boson pT and rapidity distributions from the GPU and CPU implementations, pull distributions demonstrating agreement within Monte Carlo statistical uncertainties, and a table reporting Kolmogorov-Smirnov distances together with the associated p-values. These additions confirm that the physics outputs are equivalent within expected sampling fluctuations and thereby support the performance comparison. revision: yes

Circularity Check

0 steps flagged

No significant circularity; performance claims rest on direct independent benchmarks

full rationale

The paper implements an NLO-matched parton shower (GAPS v2) in CUDA for GPU and provides a near-identical C++ reference for CPU. The central result is a side-by-side timing and energy-consumption comparison for NLO Z production at the LHC, executed on the same physics process with released code. No parameters are fitted to the benchmark data, no self-referential definitions appear in the matching or shower algorithms, and no derivation chain reduces to prior self-citations by construction. The equivalence of physics output is externally verifiable via the released implementations, rendering the reported performance numbers self-contained measurements rather than tautological.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is an implementation and benchmarking paper; the central claim rests on code correctness and hardware timing rather than new theoretical assumptions or fitted parameters.

pith-pipeline@v0.9.0 · 5416 in / 1104 out tokens · 33533 ms · 2026-05-17T05:41:17.446479+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

55 extracted references · 55 canonical work pages · 15 internal anchors

  1. [1]

    Bothmann, W

    E. Bothmann, W . Giele, S. Hoeche, J. Isaacson and M. Knobbe, Many-gluon tree ampli- tudes on modern GPUs: A case study for novel event generators, SciPost Phys. Codeb.2022, 3 (2022), doi:10.21468 /SciPostPhysCodeb.3, 2106.06507

  2. [2]

    A GPU compatible quasi-Monte Carlo integrator interfaced to pySecDec

    S. Borowka, G. Heinrich, S. Jahn, S. P . Jones, M. Kerner and J. Schlenk,A GPU compatible quasi-Monte Carlo integrator interfaced to pySecDec , Comput. Phys. Commun. 240, 120 (2019), doi:10.1016 /j.cpc.2019.02.015, 1811.11720

  3. [3]

    Heinrich, S

    G. Heinrich, S. P . Jones, M. Kerner, V . Magerya, A. Olsson and J. Schlenk, Numerical scattering amplitudes with pySecDec , Comput. Phys. Commun. 295, 108956 (2024), doi:10.1016/j.cpc.2023.108956, 2305.19768

  4. [4]

    J. M. Cruz-Martinez, G. De Laurentis and M. Pellen, Accelerating Berends–Giele recursion for gluons in arbitrary dimensions over finite fields , Eur. Phys. J. C 85(5), 590 (2025), doi:10.1140/epjc/s10052-025-14318-3, 2502.07060. 24 SciPost Physics Codebases Submission

  5. [5]

    LHAPDF6: parton density access in the LHC precision era

    A. Buckley , J. Ferrando, S. Lloyd, K. Nordström, B. Page, M. Rüfenacht, M. Schönherr and G. Watt, LHAPDF6: parton density access in the LHC precision era, Eur. Phys. J. C 75, 132 (2015), doi:10.1140 /epjc/s10052-015-3318-8, 1412.7420

  6. [6]

    Carrazza, J

    S. Carrazza, J. M. Cruz-Martinez and M. Rossi,PDFFlow: Parton distribution functions on GPU, Comput. Phys. Commun. 264, 107995 (2021), doi:10.1016 /j.cpc.2021.107995, 2009.06635

  7. [7]

    Bothmann, T

    E. Bothmann, T . Childers, W . Giele, S. Höche, J. Isaacson and M. Knobbe, A portable parton-level event generator for the high-luminosity LHC, SciPost Phys.17(3), 081 (2024), doi:10.21468/SciPostPhys.17.3.081, 2311.06198

  8. [8]

    Hageböck, D

    S. Hageböck, D. Massaro, O. Mattelaer, S. Roiser, A. Valassi and Z. Wettersten, Data- parallel leading-order event generation in MadGraph5_aMC@NLO (2025), 2507.21039

  9. [9]

    Carrazza, J

    S. Carrazza, J. Cruz-Martinez, M. Rossi and M. Zaro, MadFlow: automating Monte Carlo simulation on GPU for particle physics processes , Eur. Phys. J. C 81(7), 656 (2021), doi:10.1140/epjc/s10052-021-09443-8, 2106.10279

  10. [10]

    com/cuda/cuda-c-programming-guide /index.html#, Accessed: 2025-08-11

    NVIDIA Corporation & affiliates, CUDA C++ Programming Guide, https: //docs.nvidia. com/cuda/cuda-c-programming-guide /index.html#, Accessed: 2025-08-11

  11. [11]

    M. H. Seymour and S. Sule, An algorithm to parallelise parton showers on a GPU, SciPost Phys. Codebases p. 33 (2024), doi:10.21468 /SciPostPhysCodeb.33

  12. [12]

    M. H. Seymour and S. Sule, Codebase release 1.1 for GAPS, SciPost Phys. Codebases pp. 33–r1.1 (2024), doi:10.21468/SciPostPhysCodeb.33-r1.1

  13. [13]

    Intel Coorportation, Intel Xeon Processor E5-2620 v4 (20M Cache, 2.10 GHz) Specifications, https: //www.intel.com/content/www/us/en/products/sku/92986/ intel-xeon-processor-e52620-v4-20m-cache-2-10-ghz /specifications.html, Accessed: 2025-09-06

  14. [14]

    NVIDIA Corporation & affiliates, Nvidia tesla v100 , https: //www.nvidia.com/en-gb/ data-center/v100/, Accessed: 2025-08-11

  15. [15]

    NVIDIA Corporation & affiliates, Thrust: The C ++ Parallel Algorithms Library , https: //nvidia.github.io/cccl/thrust/, Accessed: 2025-08-11

  16. [16]

    NVIDIA Corporation & affiliates, Nvidia a100 , https: //www.nvidia.com/en-gb/ data-center/a100/, Accessed: 2025-08-11

  17. [17]

    van Werkhoven, Kernel tuner: A search-optimizing gpu code auto-tuner, Future Generation Computer Systems 90, 347 (2019), doi:https://doi.org/10.1016/j.future.2018.08.004

    B. van Werkhoven, Kernel tuner: A search-optimizing gpu code auto-tuner, Future Generation Computer Systems 90, 347 (2019), doi:https://doi.org/10.1016/j.future.2018.08.004

  18. [18]

    Petroviˇc and J

    F . Petroviˇc and J. Filipovi ˇc, Kernel tuning toolkit , SoftwareX 22, 101385 (2023), doi:https://doi.org/10.1016/j.softx.2023.101385

  19. [19]

    The anti-k_t jet clustering algorithm

    M. Cacciari, G. P . Salam and G. Soyez,The anti-kt jet clustering algorithm, JHEP 04, 063 (2008), doi:10.1088 /1126-6708/2008/04/063, 0802.1189

  20. [20]

    Bierlich, A

    C. Bierlich, A. Buckley , J. M. Butterworth, C. Gütschow, L. Lönnblad, T . Procter, P . Richard- son and Y. Yeh, Robust independent validation of experiment and theory: Rivet version 4 release note, SciPost Phys. Codeb. 36, 1 (2024), doi:10.21468 /SciPostPhysCodeb.36, 2404.15984. 25 SciPost Physics Codebases Submission

  21. [21]

    Buckley , L

    A. Buckley , L. Corpe, M. Filipovich, C. Gütschow, N. Rozinsky , S. Thor, Y. Yeh and J. Yellen, Consistent, multidimensional differential histogramming and summary statistics with YODA 2, SciPost Phys. Codeb. 45 (2023), doi:10.21468 /SciPostPhysCodeb.45, 2312.15070

  22. [22]

    Bewick et al

    G. Bewick et al. , Herwig 7.3 Release Note , Eur. Phys. J. C 84(10), 1053 (2024), doi:10.1140/epjc/s10052-024-13211-9, 2312.05175

  23. [23]

    The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations

    J. Alwall, R. Frederix, S. Frixione, V . Hirschi, F . Maltoni, O. Mattelaer, H. S. Shao, T . Stelzer, P . Torrielli and M. Zaro,The automated computation of tree-level and next-to- leading order differential cross sections, and their matching to parton shower simulations , JHEP 07, 079 (2014), doi:10.1007 /JHEP07(2014)079, 1405.0301

  24. [24]

    Buccioni, J

    F . Buccioni, J. N. Lang, J. M. Lindert, P . Maierhöfer, S. Pozzorini, H. Zhang and M. F . Zoller, OpenLoops 2, Eur. Phys. J. C79(10), 866 (2019), doi:10.1140/epjc/s10052-019-7306-2, 1907.13071

  25. [25]

    Dipole Showers and Automated NLO Matching in Herwig++

    S. Plätzer and S. Gieseke, Dipole Showers and Automated NLO Matching in Herwig ++, Eur. Phys. J. C 72, 2187 (2012), doi:10.1140 /epjc/s10052-012-2187-7, 1109.6256

  26. [26]

    NVIDIA Corporation & affiliates, NVIDIA Nsight Systems, https: //developer.nvidia.com/ nsight-systems, Accessed: 2025-08-17 (2024)

  27. [27]

    Intel Coorportation, Intel Xeon Processor Gold 5220R (35.75M Cache, 2.20 GHz) Specifications, https: //www.intel.com/content/www/us/en/products/sku/199354/ intel-xeon-gold-5220r-processor-35-75m-cache-2-20-ghz /specifications.html, Ac- cessed: 2025-09-06

  28. [28]

    Lottick, S

    K. Lottick, S. Susai, S. A. Friedler and J. P . Wilson, Energy usage reports: Environmental awareness as part of algorithmic accountability, In Workshop on Tackling Climate Change with Machine Learning at NeurIPS 2019 (2019), 1911.08354

  29. [29]

    CodeCarbon Development Team, CodeCarbon: Track and Reduce Your Carbon Emissions from Computing, https: //codecarbon.io/, Accessed: 2025-08-17

  30. [30]

    nvidia.com/cuda-gpus#, Accessed: 2025-10-26

    NVIDIA Corporation & affiliates, CUDA GPU Compute Capability , https: //developer. nvidia.com/cuda-gpus#, Accessed: 2025-10-26

  31. [31]

    R. K. Ellis, W . J. Stirling and B. R. Webber, QCD and collider physics , vol. 8, Cambridge University Press, ISBN 978-0-511-82328-2, 978-0-521-54589-1, doi:10.1017/CBO9780511628788 (2011)

  32. [32]

    Dutch Book Arguments

    J. Campbell, J. Huston and F . Krauss, The Black Book of Quantum Chromodynam- ics : a Primer for the LHC Era , Oxford University Press, ISBN 978-0-19-965274-7, doi:10.1093/oso/9780199652747.001.0001 (2018)

  33. [33]

    General-purpose event generators for LHC physics

    A. Buckley et al., General-purpose event generators for LHC physics, Phys. Rept. 504, 145 (2011), doi:10.1016 /j.physrep.2011.03.005, 1101.2599

  34. [34]

    S. Höche, Introduction to parton-shower event generators , In Theoretical Advanced Study Institute in Elementary Particle Physics: Journeys Through the Precision Frontier: Amplitudes for Colliders , pp. 235–295, doi:10.1142 /9789814678766_0005 (2015), 1411.4085

  35. [35]

    Yu. L. Dokshitzer, Calculation of the Structure Functions for Deep Inelastic Scattering and e+ e- Annihilation by Perturbation Theory in Quantum Chromodynamics., Sov. Phys. JETP 46, 641 (1977). 26 SciPost Physics Codebases Submission

  36. [36]

    V . N. Gribov and L. N. Lipatov,Deep inelastic e p scattering in perturbation theory, Sov. J. Nucl. Phys. 15, 438 (1972)

  37. [37]

    Altarelli and G

    G. Altarelli and G. Parisi, Asymptotic Freedom in Parton Language , Nucl. Phys. B 126, 298 (1977), doi:10.1016 /0550-3213(77)90384-4

  38. [38]

    Lönnblad, ARIADNE version 4: A Program for simulation of QCD cascades implementing the color dipole model , Comput

    L. Lönnblad, ARIADNE version 4: A Program for simulation of QCD cascades implementing the color dipole model , Comput. Phys. Commun. 71, 15 (1992), doi:10.1016 /0010- 4655(92)90068-A

  39. [39]

    Nagy and D

    Z. Nagy and D. E. Soper, A New parton shower algorithm: Shower evolution, match- ing at leading and next-to-leading order level , In Ringberg Workshop on New Trends in HERA Physics 2005, pp. 101–123, doi:10.1142/9789812773524_0010 (2006), hep-ph/ 0601021

  40. [40]

    Parton showers from the dipole formalism

    M. Dinsdale, M. Ternick and S. Weinzierl,Parton showers from the dipole formalism, Phys. Rev. D76, 094003 (2007), doi:10.1103 /PhysRevD.76.094003, 0709.1026

  41. [41]

    A parton shower algorithm based on Catani-Seymour dipole factorisation

    S. Schumann and F . Krauss, A Parton shower algorithm based on Catani-Seymour dipole factorisation, JHEP 03, 038 (2008), doi:10.1088/1126-6708/2008/03/038, 0709.1027

  42. [42]

    Catani and M

    S. Catani and M. H. Seymour,A General algorithm for calculating jet cross-sections in NLO QCD, Nucl. Phys. B 485, 291 (1997), doi:10.1016/S0550-3213(96)00589-5, [Erratum: Nucl.Phys.B 510, 503–504 (1998)], hep-ph /9605323

  43. [43]

    J. C. Winter and F . Krauss, Initial-state showering based on colour dipoles connected to incoming parton lines , JHEP 07, 040 (2008), doi:10.1088 /1126-6708/2008/07/040, 0712.3913

  44. [44]

    Coherent Parton Showers with Local Recoils

    S. Plätzer and S. Gieseke, Coherent Parton Showers with Local Recoils , JHEP 01, 024 (2011), doi:10.1007 /JHEP01(2011)024, 0909.5593

  45. [45]

    Transverse-Momentum-Ordered Showers and Interleaved Multiple Interactions

    T . Sjöstrand and P . Z. Skands, Transverse-momentum-ordered showers and interleaved multiple interactions, Eur. Phys. J. C 39, 129 (2005), doi:10.1140 /epjc/s2004-02084-y, hep-ph/0408302

  46. [46]

    Dasgupta, F

    M. Dasgupta, F . A. Dreyer, K. Hamilton, P . F . Monni, G. P . Salam and G. Soyez,Parton showers beyond leading logarithmic accuracy , Phys. Rev. Lett. 125(5), 052002 (2020), doi:10.1103/PhysRevLett.125.052002, 2002.11114

  47. [47]

    J. R. Forshaw, J. Holguin and S. Plätzer, Building a consistent parton shower , JHEP 09, 014 (2020), doi:10.1007 /JHEP09(2020)014, 2003.06400

  48. [48]

    Herren, S

    F . Herren, S. Höche, F . Krauss, D. Reichelt and M. Schönherr,A new approach to color- coherent parton evolution, JHEP 10, 091 (2023), doi:10.1007/JHEP10(2023)091, 2208. 06057

  49. [49]

    Nagy and D

    Z. Nagy and D. E. Soper, Summations of large logarithms by parton showers , Phys. Rev. D 104(5), 054049 (2021), doi:10.1103 /PhysRevD.104.054049, 2011.04773

  50. [50]

    C. T . Preuss, A partitioned dipole-antenna shower with improved transverse recoil , JHEP 07, 161 (2024), doi:10.1007 /JHEP07(2024)161, 2403.19452

  51. [51]

    ExSample -- A Library for Sampling Sudakov-Type Distributions

    S. Plätzer, ExSample: A Library for Sampling Sudakov-Type Distributions, Eur. Phys. J. C 72, 1929 (2012), doi:10.1140 /epjc/s10052-012-1929-x, 1108.6182. 27 SciPost Physics Codebases Submission

  52. [52]

    M. Bähr, S. Gieseke and M. H. Seymour, Simulation of multiple partonic interactions in Herwig++, JHEP 07, 076 (2008), doi:10.1088 /1126-6708/2008/07/076, 0803.3633

  53. [53]

    M. H. Seymour,A Simple prescription for first order corrections to quark scattering and an- nihilation processes, Nucl. Phys. B 436, 443 (1995), doi:10.1016/0550-3213(94)00554- R, hep-ph /9410244

  54. [54]

    Plehn, D

    T . Plehn, D. Rainwater and P . Z. Skands,Squark and gluino production with jets , Phys. Lett. B 645, 217 (2007), doi:10.1016 /j.physletb.2006.12.009, hep-ph /0510144

  55. [55]

    Parton Shower Uncertainties with Herwig 7: Benchmarks at Leading Order

    J. Bellm, G. Nail, S. Plätzer, P . Schichtel and A. Siódmok, Parton Shower Uncertain- ties with Herwig 7: Benchmarks at Leading Order , Eur. Phys. J. C 76(12), 665 (2016), doi:10.1140/epjc/s10052-016-4506-x, 1605.01338. 28