Recognition: unknown
Experiences with OpenMP in tmLQCD
read the original abstract
An overview is given of the lessons learned from the introduction of multi-threading using OpenMP in tmLQCD. In particular, programming style, performance measurements, cache misses, scaling, thread distribution for hybrid codes, race conditions, the overlapping of communication and computation and the measurement and reduction of certain overheads are discussed. Performance measurements and sampling profiles are given for different implementations of the hopping matrix computational kernel.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Charmonium radiative transitions to dileptons from lattice QCD: The case of $h_c \to \eta_c \ell^+\ell^-$ and $\chi_{c1} \to J/\psi\,\ell^+\ell^-$
First fully dynamical lattice QCD yields Γ(h_c → η_c e⁺e⁻) = 5.45(19) keV (3σ above BESIII) and Γ(χ_c1 → J/ψ e⁺e⁻) = 2.869(90) keV, with continuum-extrapolated results and q² distributions.
-
Scalar and Tensor Form Factors for $\Lambda \rightarrow p\ell \bar{\nu}_\ell$ from Lattice QCD
Lattice QCD yields the scalar and tensor form factors for Λ→pℓν̄ℓ as functions of q², providing a model-independent input to constrain non-standard charged-current interactions via the predicted R^{μe} ratio compared ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.