pith. sign in

arxiv: 1411.0127 · v1 · pith:MWJF4JS3new · submitted 2014-11-01 · ❄️ cond-mat.dis-nn · physics.comp-ph

Highly optimized simulations on single- and multi-GPU systems of 3D Ising spin glass

classification ❄️ cond-mat.dis-nn physics.comp-ph
keywords implementationmulti-gpucubiccudahighlyisingmetropolismodel
0
0 comments X
read the original abstract

We present a highly optimized implementation of a Monte Carlo (MC) simulator for the three-dimensional Ising spin-glass model with bimodal disorder, i.e., the 3D Edwards-Anderson model running on CUDA enabled GPUs. Multi-GPU systems exchange data by means of the Message Passing Interface (MPI). The chosen MC dynamics is the classic Metropolis one, which is purely dissipative, since the aim was the study of the critical off-equilibrium relaxation of the system. We focused on the following issues: i) the implementation of efficient access patterns for nearest neighbours in a cubic stencil and for lagged-Fibonacci-like pseudo-Random Numbers Generators (PRNGs); ii) a novel implementation of the asynchronous multispin-coding Metropolis MC step allowing to store one spin per bit and iii) a multi-GPU version based on a combination of MPI and CUDA streams. We highlight how cubic stencils and PRNGs are two subjects of very general interest because of their widespread use in many simulation codes. Our code best performances ~3 and ~5 psFlip on a GTX Titan with our implementations of the MINSTD and MT19937 respectively.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Microcanonical simulated annealing: Massively parallel Monte Carlo simulations with sporadic random-number generation

    cond-mat.stat-mech 2025-06 conditional novelty 7.0

    MicSA reduces random-number generation in Monte Carlo simulations for 3D Ising spin glasses, supporting massively parallel GPU execution whose dynamics map to standard results via time rescaling.