pith. sign in

arxiv: 2606.24154 · v1 · pith:3ZL52WSHnew · submitted 2026-06-23 · ⚛️ physics.ins-det · physics.comp-ph

neBEM: A GPU-accelerated Electrostatic Field Solver

Pith reviewed 2026-06-25 22:24 UTC · model grok-4.3

classification ⚛️ physics.ins-det physics.comp-ph
keywords neBEMGPUCUDAelectrostatic fieldMPGDGEMboundary element methodspace charge calculation
0
0 comments X

The pith

GPU acceleration makes the nearly exact Boundary Element Method practical for large detector simulations by delivering major speedups without accuracy loss.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper optimizes the nearly exact Boundary Element Method for calculating electric fields in Micro Pattern Gaseous Detectors. It introduces a hybrid approach with OpenMP on CPUs and CUDA on GPUs, featuring a new accelerated dynamic space charge calculation along with algorithmic improvements. The result is substantial speedups that keep the original accuracy intact. Verification on staggered thick Gas Electron Multiplier geometries shows matching results with commercial solvers, opening the way for rapid precise simulations of complex setups.

Core claim

The implementation of OpenMP and CUDA acceleration in neBEM, including the dynamic space charge calculation, achieves substantial speedups while preserving the solver's inherent accuracy, with simulations on staggered thick Gas Electron Multiplier geometries demonstrating agreement with other commercially available field solvers.

What carries the argument

The CUDA-accelerated dynamic space charge calculation within the optimized neBEM solver.

If this is right

  • Large-scale MPGD configurations can be simulated rapidly yet precisely.
  • Benchmarking shows significant speedup making the tool practical for detector design.
  • The accelerated version maintains fidelity as verified against commercial solvers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar GPU strategies could speed up other analytical field computation methods in particle physics instrumentation.
  • Adoption in design workflows may allow iterative optimization of detector geometries that was previously computationally prohibitive.
  • Potential exists for extending the acceleration to time-dependent or more complex charge distributions.

Load-bearing premise

The CUDA implementation of the dynamic space charge calculation and the algorithmic optimizations introduce no numerical discrepancies or artifacts that would affect field accuracy.

What would settle it

A side-by-side computation on a staggered thick GEM geometry where the electric field values from the GPU-accelerated neBEM differ from those of a commercial solver by more than the tolerance expected from floating-point precision.

Figures

Figures reproduced from arXiv: 2606.24154 by Nayana Majumdar, Purba Bhattacharya, Shubhabrata Dutta, Supratik Mukhopadhyay, Tanay Dey.

Figure 1
Figure 1. Figure 1: Rectangular (Left) and triangular (Right) BEM meshes for a cylindrical surface. [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Calculation of Potential and Field at a point [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Hybrid execution model of neBEM process begins with the setup of the detector geometry using Garfield++’s native geometry tools. Once the geometry is defined, the neBEM interface proceeds through initialization, geometry reading, and discretization of the boundary surfaces into elements. The main computational workflow of the neBEM solver is initiated by the neBEMSolve() routine, which orchestrates the ent… view at source ↗
Figure 4
Figure 4. Figure 4: Workflow of neBEM 4.2. Space Charge Implementation: As previously discussed, the contribution of known space charges can be incorporated into the calculation by enabling the OptKnCh flags. There are two key stages in the solution pipeline where these contributions must be ac￾counted for: first, during the formation of the RH vector, and second, during the final evaluation of potential and field at any arbi… view at source ↗
Figure 5
Figure 5. Figure 5: Geometry of the parallel plate model as seen from the Garfield++ geometry [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: The basic unit cell used in neBEM, consisting of two cylindrical holes (left). [PITH_FULL_IMAGE:figures/full_fig_p023_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: The hexagonal unit cell used in COMSOL (left) and the visualization of the [PITH_FULL_IMAGE:figures/full_fig_p024_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Schematic representation of the single-layer THGEM detector setup, showing [PITH_FULL_IMAGE:figures/full_fig_p024_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Comparison of the electric field magnitude along a line parallel to the X-axis [PITH_FULL_IMAGE:figures/full_fig_p026_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Comparison of the electric field magnitude along the central axis (red dotted [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Breakdown of execution time by kernel stage for various thread counts. The [PITH_FULL_IMAGE:figures/full_fig_p028_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Comparison of total wall-clock execution time for the THGEM simulation using [PITH_FULL_IMAGE:figures/full_fig_p029_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Evolution of the space-charge distribution during avalanche development in the [PITH_FULL_IMAGE:figures/full_fig_p033_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Axial electric field magnitude along the central axis of the THGEM hole in the [PITH_FULL_IMAGE:figures/full_fig_p034_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Electric field magnitude evaluated along a line parallel to the x-axis passing [PITH_FULL_IMAGE:figures/full_fig_p035_15.png] view at source ↗
read the original abstract

Accurate electric field estimation is critical for the design and optimization of Micro Pattern Gaseous Detectors (MPGDs). The nearly exact Boundary Element Method (neBEM) offers high precision field computation but is limited by long CPU runtime arising from its complex analytical formulations. This work presents a comprehensive optimization of the neBEM solver, focusing on a hybrid hardware acceleration strategy using OpenMP for multi-core CPUs and GPU acceleration using NVIDIA's CUDA. A key contribution is the new implementation of a dynamic space charge calculation, which has also been designed to be accelerated by CUDA. This primary acceleration is complemented by enhanced algorithmic optimizations to reduce the complexity of the problem. The proposed implementation achieves substantial speedups while preserving inherent accuracy of the solver. Simulations on staggered thick Gas Electron Multiplier geometries demonstrate agreement with other commercially available field solvers, verifying the fidelity of accelerated neBEM. Benchmarking tests show a significant speedup, enabling rapid yet precise simulations for complex MPGD configurations. These improvements make GPU-accelerated neBEM a practical tool for large-scale detector simulation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents a hybrid OpenMP/CUDA implementation of the nearly exact Boundary Element Method (neBEM) electrostatic solver, with a new dynamic space charge module also accelerated on GPU. It claims that the optimizations deliver substantial speedups while preserving the solver's inherent accuracy, as verified by agreement with commercial field solvers on staggered thick Gas Electron Multiplier (GEM) geometries. Benchmarking is said to show significant performance gains for complex MPGD simulations.

Significance. If the accuracy claims hold under rigorous numerical checks, the work would make high-precision neBEM simulations practical for large-scale MPGD design and optimization, addressing a known computational bottleneck in the field. The hybrid acceleration strategy and explicit CUDA port of dynamic space charge are potentially valuable contributions, though the absence of quantitative error metrics and isolated validation tests limits the assessed impact.

major comments (2)
  1. [Abstract] Abstract: the central claim that the CUDA implementation 'preserves inherent accuracy' while achieving speedups rests on agreement with commercial solvers on staggered thick GEM geometries, yet no quantitative error metrics (e.g., maximum field deviation, RMS residuals, or convergence rates) or benchmark tables are provided. This omission prevents assessment of whether the reported agreement is within acceptable tolerances for detector applications.
  2. [Abstract] Abstract (dynamic space charge section): the new CUDA implementation of dynamic space charge is highlighted as a key contribution, but validation relies solely on external commercial-solver comparisons. These cannot isolate potential numerical discrepancies introduced by the GPU port (e.g., reduced-precision reductions, atomic operations, or summation reordering). Direct CPU-vs-GPU residual checks on the space-charge component or analytical test cases are required to confirm numerical equivalence.
minor comments (2)
  1. The abstract refers to 'enhanced algorithmic optimizations' without specifying which algorithms were modified or how complexity was reduced; a dedicated methods subsection would improve clarity.
  2. Benchmarking results are mentioned but not detailed (e.g., hardware specifications, problem sizes, or speedup factors relative to the original CPU neBEM); inclusion of such data would strengthen the performance claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment below and will revise the paper to strengthen the validation sections.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the CUDA implementation 'preserves inherent accuracy' while achieving speedups rests on agreement with commercial solvers on staggered thick GEM geometries, yet no quantitative error metrics (e.g., maximum field deviation, RMS residuals, or convergence rates) or benchmark tables are provided. This omission prevents assessment of whether the reported agreement is within acceptable tolerances for detector applications.

    Authors: We agree that quantitative error metrics are needed to allow readers to evaluate the accuracy claim rigorously. In the revised manuscript we will add a table (and accompanying text) reporting maximum field deviations, RMS residuals, and convergence behavior from the neBEM–commercial-solver comparisons on the staggered thick GEM geometries, together with a brief discussion of the tolerances relevant to MPGD design. revision: yes

  2. Referee: [Abstract] Abstract (dynamic space charge section): the new CUDA implementation of dynamic space charge is highlighted as a key contribution, but validation relies solely on external commercial-solver comparisons. These cannot isolate potential numerical discrepancies introduced by the GPU port (e.g., reduced-precision reductions, atomic operations, or summation reordering). Direct CPU-vs-GPU residual checks on the space-charge component or analytical test cases are required to confirm numerical equivalence.

    Authors: The referee correctly identifies that external comparisons alone do not isolate GPU-specific numerical effects. We will add, in the revised version, explicit CPU-versus-GPU residual comparisons for the dynamic space-charge module on representative test cases, including checks for summation and reduction consistency. Where feasible we will also include simple analytical benchmarks to confirm numerical equivalence. revision: yes

Circularity Check

0 steps flagged

Implementation report with external benchmarks; no derivation chain present

full rationale

The manuscript is an engineering/implementation paper describing CUDA/OpenMP acceleration of an existing neBEM solver plus a new dynamic space-charge module. Accuracy is asserted via direct numerical agreement with independent commercial field solvers on staggered thick GEM geometries, not via any internal derivation, fitted parameter, or self-citation chain. No equations, uniqueness theorems, or ansatzes are introduced that could reduce to the paper's own inputs. The central performance and fidelity claims therefore rest on external validation and benchmarking rather than self-referential construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; the work rests on the unstated premise that the original neBEM formulation remains numerically stable under the described parallelization.

pith-pipeline@v0.9.1-grok · 5730 in / 1046 out tokens · 18056 ms · 2026-06-25T22:24:47.992549+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

22 extracted references · 17 canonical work pages

  1. [1]

    Bouclier, M

    R. Bouclier, M. Capeans, W. Dominik, M. Hoch, J.-C. Labbe, G. Mil- lion, L. Ropelewski, F. Sauli, A. Sharma, The gas electron multiplier (gem), IEEE Transactions on Nuclear Science 44 (3) (1997) 646–650. doi:10.1109/23.603726

  2. [2]

    Giomataris, P

    Y. Giomataris, P. Rebourgeard, J. Robert, G. Charpak, Mi- cromegas: a high-granularity position-sensitive gaseous detector for high particle-flux environments, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrom- eters, Detectors and Associated Equipment 376 (1) (1996) 29–35. doi:https://doi.org/10.1016/0168-9002(96)00175...

  3. [3]

    URLhttps://www.sciencedirect.com/science/article/pii/ S0168900222011305

    P.Roy, P.K.Rout, J.Datta, P.Bhattacharya, S.Mukhopadhyay, N.Ma- jumdar, S.Sarkar, Studyofspacechargephenomenaingem-baseddetec- tors, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 1047 (2023) 167838.doi:https://doi.org/10.1016/j.nima.2022.167838. URLhttps://www.sciencedirect.c...

  4. [4]

    T. Dey, P. Bhattacharya, S. Mukhopadhyay, N. Majumdar, A. Seal, S. Chattopadhyay, Parallelization of garfield++ and nebem to simulate space-charge effects in rpcs, Computer Physics Communications 294 (2024) 108944.doi:https://doi.org/10.1016/j.cpc.2023.108944. URLhttps://www.sciencedirect.com/science/article/pii/ S0010465523002898

  5. [5]

    Bhattacharya, P

    P. Bhattacharya, P. Roy, T. Dey, J. Datta, P. K. Rout, N. Majumdar, S. Mukhopadhyay, Numerical simulation of charging up, accumulation of space charge and formation of discharges, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spec- trometers, Detectors and Associated Equipment 1075 (2025) 170336. doi:https://doi.org/10.1016/...

  6. [6]

    Garfield++,https://gitlab.cern.ch/garfield/garfieldpp, ac- cessed: 12.08.2025

  7. [7]

    S. Biagi, Monte carlo simulation of electron drift and diffusion in count- ing gases under the influence of electric and magnetic fields, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 421 (1) (1999) 234–240.doi:https://doi.org/10.1016/S0168-9002(98)01233-9. URLhttps://www.sci...

  8. [8]

    I. Smirnov, Modeling of ionization produced by fast charged par- ticles in gases, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 554 (1) (2005) 474–493.doi:https: //doi.org/10.1016/j.nima.2005.08.064. URLhttps://www.sciencedirect.com/science/article/pii/ S0168900205016724

  9. [9]

    Comsol multiphysics,https://www.comsol.co.in, accessed: 03.11.2025

  10. [10]

    Ansys,https://www.ansys.com, accessed: 03.11.2025

  11. [11]

    Muhkopadhyay, N

    S. Muhkopadhyay, N. Majumdar, Computation of 3d mems electrostatics using a nearly exact bem solver, Engineering Analysis with Boundary Elements 30 (8) (2006) 687–696. doi:https://doi.org/10.1016/j.enganabound.2006.03.002. URLhttps://www.sciencedirect.com/science/article/pii/ S0955799706000518

  12. [12]

    Majumdar, S

    N. Majumdar, S. Mukhopadhyay, Simulation of three-dimensional elec- trostaticfieldconfigurationinwirechambers: Anovelapproach, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 566 (2) (2006) 489–494.doi:https://doi.org/10.1016/j.nima.2006.06.035. URLhttps://www.sciencedirect.com...

  13. [13]

    Majumdar, S

    N. Majumdar, S. Mukhopadhyay, Simulation of 3d electrostatic config- uration in gaseous detectors, Journal of Instrumentation 2 (09) (2007) P09006.doi:10.1088/1748-0221/2/09/P09006. URLhttps://doi.org/10.1088/1748-0221/2/09/P09006

  14. [14]

    Majumdar, S

    N. Majumdar, S. Mukhopadhyay, S. Bhattacharya, Computation of 3d electrostatic weighting field in resistive plate chambers, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 595 (2) (2008) 346–352.doi:https://doi.org/10.1016/j.nima.2008.07.033. URLhttps://www.sciencedirect.com/sc...

  15. [15]

    Mukhopadhyay, N

    S. Mukhopadhyay, N. Majumdar, A study of three-dimensional edge and corner problems using the nebem solver, Engineer- ing Analysis with Boundary Elements 33 (2) (2009) 105–119. doi:https://doi.org/10.1016/j.enganabound.2008.06.003. URLhttps://www.sciencedirect.com/science/article/pii/ S0955799708001045

  16. [16]

    T. Neep, K. Nikolopoulos, M. Slater, Accelerating garfield++ with cuda, Journal of Instrumentation 20 (12) (2025) P12019.doi:10.1088/ 1748-0221/20/12/P12019. URLhttps://doi.org/10.1088/1748-0221/20/12/P12019

  17. [17]

    Dutta, P

    S. Dutta, P. Bhattacharya, T. Dey, N. Majumdar, S. Mukhopadhyay, Numerical simulation of space charge effects in mpgds, Journal of In- strumentation 20 (06) (2025) C06054.doi:10.1088/1748-0221/20/ 06/C06054. URLhttps://doi.org/10.1088/1748-0221/20/06/C06054

  18. [18]

    Openmp,https://www.openmp.org, accessed: 03.11.2025

  19. [19]

    Nvidia cuda toolkit,https://developer.nvidia.com/cuda-toolkit, accessed: 03.11.2025

  20. [20]

    Bhattacharya, S

    P. Bhattacharya, S. Biswas, B. Mohanty, N. Majumdar, S. Mukhopad- hyay, Detailed 3d simulation of the gem-based detector, Journal of Physics: Conference Series 759 (1) (2016) 012071.doi:10.1088/ 39 1742-6596/759/1/012071. URLhttps://doi.org/10.1088/1742-6596/759/1/012071

  21. [21]

    S. Biagi, A multiterm boltzmann analysis of drift velocity, diffusion, gain and magnetic-field effects in argon-methane-water-vapour mixtures, Nu- clear Instruments and Methods in Physics Research Section A: Accelera- tors, Spectrometers, DetectorsandAssociatedEquipment283(3)(1989) 716–722.doi:https://doi.org/10.1016/0168-9002(89)91446-0. URLhttps://www.s...

  22. [22]

    W. H. Furry, On fluctuation phenomena in the passage of high energy electrons through lead, Phys. Rev. 52 (1937) 569–581.doi:10.1103/ PhysRev.52.569. URLhttps://link.aps.org/doi/10.1103/PhysRev.52.569 40