pith. sign in

arxiv: 2512.04997 · v2 · submitted 2025-12-04 · ⚛️ physics.comp-ph

LEDDS: Portable LBM-DEM simulations on GPUs

Pith reviewed 2026-05-17 00:51 UTC · model grok-4.3

classification ⚛️ physics.comp-ph
keywords LBM-DEMGPU simulationsportable computingparallel primitivesgranular flowsfluid-particle couplingLattice Boltzmann MethodDiscrete Element Method
0
0 comments X

The pith

LBM-DEM simulations achieve hand-tuned CUDA performance using only portable parallel primitives on GPUs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces LEDDS as a framework for fully coupled Lattice Boltzmann and Discrete Element Method simulations on single-GPU platforms. It expresses the complete workflow, including neighbor search, collision detection, and fluid-particle coupling, as sequences of portable parallel primitives drawn from the C++ Standard Library and Thrust. The approach is validated across DEM and LBM-DEM benchmarks such as sphere and ellipsoid collisions, single-particle settling, Jeffery's orbits, and particle-laden shear flows. If correct, the work shows that high-level algorithmic formulations can deliver performance comparable to custom low-level code while preserving portability and readability. This would matter because it offers a route to maintainable multiphysics codes that transfer across GPU systems without repeated device-specific rewrites.

Core claim

LEDDS demonstrates that fully coupled LBM-DEM simulations can be performed by composing the entire workflow from algorithmic primitives rather than handcrafted GPU kernels. The full process of neighbor search, collision detection, and fluid-particle coupling is expressed as portable operations, and the resulting code runs efficiently on single GPUs. Benchmarks confirm that this high-abstraction implementation matches the performance of hand-tuned CUDA solvers while remaining portable to other HPC environments and keeping the source code clear.

What carries the argument

Algorithmic formulations that express computations as compositions of well-defined parallel primitives such as map, sort, and reduce, applied to the full LBM-DEM workflow.

If this is right

  • Performance comparable to hand-tuned CUDA solvers is achieved despite the high level of abstraction.
  • The framework maintains portability across a wide range of modern GPU systems and future accelerators.
  • Code clarity is preserved while delivering high performance for complex fluid-particle simulations.
  • LEDDS serves as a blueprint for portable multiphysics frameworks based on algorithmic primitives.
  • The method applies to both pure DEM and coupled LBM-DEM configurations across the listed validation cases.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Developers without specialized GPU programming expertise could more readily implement or modify such simulations.
  • The primitive-based abstraction might extend to additional multiphysics couplings such as fluid-structure interactions.
  • Standard libraries of parallel primitives could become a common foundation for scientific HPC codes.
  • Long-term maintenance costs for simulation software may decrease as hardware changes require fewer code rewrites.

Load-bearing premise

That expressing neighbor search, collision detection, and fluid-particle coupling entirely through portable parallel primitives introduces no significant performance penalty or accuracy loss across the range of DEM and LBM-DEM benchmarks considered.

What would settle it

A side-by-side run of one of the paper's benchmarks, such as particle-laden shear flow or single-particle settling, on identical hardware where the LEDDS version runs substantially slower or produces measurably different physical quantities than a hand-tuned CUDA implementation.

Figures

Figures reproduced from arXiv: 2512.04997 by Christophe Coreixas, Jonas Latt, Maxime Rambosson, Raphael Maggio-Aprile.

Figure 1
Figure 1. Figure 1: 2D schematic of the coupled particle-fluid simulation. Bold lines [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Intersection plane which characterizes a collision between two [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Overview of the LEDDS workflow from the particle perspective. Col [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Illustration of the 2D uniform grid. Spheres [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Three-step procedure for the global computation of all unique collision pairs, illustrated with an example (solids are represented by letters instead of [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: The particle-particle forces are computed during a parallel map operation applied to all particles. For the force acting on a given particle, a non-parallel [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Head-on elastic collision: Time evolution of the spheres’ longitudinal [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Head-on elastic collision: Time evolution of total [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Setup of the simulation for the test. Before the collision, the sphere is [PITH_FULL_IMAGE:figures/full_fig_p013_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Evolution of the post-collision velocity angle (tan [PITH_FULL_IMAGE:figures/full_fig_p014_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Many-body validation: time evolution of the particle velocity distribution ( [PITH_FULL_IMAGE:figures/full_fig_p015_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Single particle settling simulation: Time evolution of the settling [PITH_FULL_IMAGE:figures/full_fig_p016_13.png] view at source ↗
Figure 15
Figure 15. Figure 15: Shear flow simulations: Instantaneous particle configurations for [PITH_FULL_IMAGE:figures/full_fig_p017_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Evolution of the apparent viscosity (η/η0) as a function of the volume fraction (ϕ). Simulation data (symbols) are compared to the Krieger￾Dougherty semi-analytical law [73]. 5. Performance analysis To assess the computational efficiency and scalability of LEDDS, we perform a series of performance benchmarks us￾ing a fluidized-bed configuration across a range of solid vol￾ume fractions. The analysis is co… view at source ↗
Figure 17
Figure 17. Figure 17: CPU performance comparison of single (FP32) and double (FP64) [PITH_FULL_IMAGE:figures/full_fig_p019_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: GPU performance comparison of a pure C++ STL implementa￾tion against an accelerated version, using the reduce_by_key algorithm and a radix-sort based version of sort from the Thrust library. Both implementations use double precision (FP64). 5.4. GPU performance Continuing with the GPU performance analysis of LEDDS, we first compare the use of STL algorithms and Thrust primi￾tives for the DEM and PSM-based… view at source ↗
Figure 19
Figure 19. Figure 19: GPU performance of the STL/Thrust version of LEDDS in sin￾gle (FP32) and double precision (FP64) across varying solid volume fractions. Both set of data were generated with the STL/Thrust version of LEDDS. double precision (FP64). In dilute configurations (ϕ = 0), where the solver is predominantly memory-bound due to the LBM updates, switching from FP64 to FP32 yields a significant performance gain simila… view at source ↗
Figure 21
Figure 21. Figure 21: GPU performance comparison between LEDDS and waLBerla [PITH_FULL_IMAGE:figures/full_fig_p021_21.png] view at source ↗
read the original abstract

Algorithmic formulations of GPU programs provide a high-level alternative to device-specific code by expressing computations as compositions of well-defined parallel primitives (e.g., map, sort, reduce), rather than through handcrafted GPU kernels. In this work, we demonstrate that this paradigm can be extended to complex and challenging problems in computational physics: the simulation of granular flows and fluid-particle interactions. LEDDS, our open-source framework, performs fully coupled Lattice Boltzmann -- Discrete Element Method (LBM-DEM) simulations using only algorithmic primitives, and runs efficiently on single-GPU platforms. The entire workflow, including neighbor search, collision detection, and fluid-particle coupling, is expressed as a sequence of portable primitives. While the current implementation illustrates these principles primarily through algorithms from the C++ Standard Library, with selective use of Thrust primitives for performance, the underlying concept is compatible with any HPC environment offering a rich set of parallel algorithms and is therefore applicable across a wide range of modern GPU systems and future accelerators. LEDDS is validated through benchmarks spanning both DEM and LBM-DEM configurations, including sphere and ellipsoid collisions, wall friction tests, single-particle settling, Jeffery's orbits, and particle-laden shear flows. Despite its high level of abstraction, LEDDS achieves performances comparable to those of hand-tuned CUDA solvers, while maintaining portability and code clarity. These results show that high-performance LBM-DEM coupling can be achieved without sacrificing generality or readability, establishing LEDDS as a blueprint for portable multiphysics frameworks based on algorithmic primitives.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The manuscript introduces LEDDS, an open-source framework for fully coupled LBM-DEM simulations on single-GPU platforms. All components—neighbor search, collision detection, and fluid-particle coupling—are expressed exclusively as compositions of portable parallel primitives drawn from the C++ Standard Library with selective Thrust usage. The work validates the approach on DEM and LBM-DEM benchmarks (sphere/ellipsoid collisions, wall friction, single-particle settling, Jeffery orbits, particle-laden shear flow) and claims that the resulting performance remains comparable to hand-tuned CUDA implementations while preserving portability and readability.

Significance. If the reported benchmark outcomes hold, the result is significant: it supplies a concrete, reproducible demonstration that complex multiphysics codes can be written at a high level of algorithmic abstraction without incurring prohibitive performance penalties. The open-source release, the explicit mapping of physical operations onto standard parallel primitives, and the absence of device-specific kernels constitute clear strengths that could serve as a template for portable HPC frameworks on current and future accelerators.

minor comments (3)
  1. §4 (Implementation): the description of how the neighbor-search primitive is composed from std::sort and std::transform should include a short pseudocode listing or explicit call sequence so that readers can reproduce the mapping without inspecting the source repository.
  2. Table 2 (Performance comparison): the reported wall-clock times for the LBM-DEM cases should be accompanied by the corresponding grid sizes, particle counts, and number of time steps to allow direct scaling comparisons with the cited hand-tuned CUDA references.
  3. §5.3 (Jeffery orbits): the statement that the computed orbits 'match analytical solutions' would be strengthened by reporting the L2 error norm or maximum angular deviation rather than a qualitative visual comparison alone.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive and accurate summary of the LEDDS framework, its use of portable algorithmic primitives for LBM-DEM coupling, and the recommendation for minor revision. The significance assessment aligns with our goals of demonstrating high-level abstractions without prohibitive performance loss.

Circularity Check

0 steps flagged

No significant circularity

full rationale

This is an implementation and benchmarking paper with no mathematical derivations, fitted parameters, predictions, or first-principles results. The central claims rest on reported benchmark outcomes (sphere/ellipsoid collisions, Jeffery orbits, particle-laden flows) and direct performance comparisons between the primitive-based approach and hand-tuned CUDA. No step reduces by construction to its own inputs, and the work is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that standard-library parallel primitives can implement the full set of LBM-DEM operations (neighbor search, collision handling, coupling) with acceptable efficiency and without custom kernels.

axioms (1)
  • domain assumption Parallel primitives available in C++ Standard Library and Thrust are sufficient to express neighbor search, collision detection, and fluid-particle coupling at high performance on GPUs.
    This assumption underpins both the portability claim and the performance comparison to hand-tuned CUDA.

pith-pipeline@v0.9.0 · 5578 in / 1280 out tokens · 65577 ms · 2026-05-17T00:51:14.110796+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

84 extracted references · 84 canonical work pages

  1. [1]

    P. A. Cundall, O. D. L. Strack, A discrete numerical model for granular assemblies, Géotechnique 29 (1) (1979) 47– 65. 22

  2. [2]

    H. Zhu, Z. Zhou, R. Yang, A. Yu, Discrete particle sim- ulation of particulate systems: Theoretical developments, Chem. Eng. Sci. 62 (13) (2007) 3378–3396, frontier of Chemical Engineering - Multi-scale Bridge between Re- ductionism and Holism

  3. [3]

    H. Zhu, Z. Zhou, R. Yang, A. Yu, Discrete particle simu- lation of particulate systems: A review of major applica- tions and findings, Chem. Eng. Sci. 63 (23) (2008) 5728– 5770

  4. [4]

    J. Zhao, T. Shan, Coupled CFD-DEM simulation of fluid- particle interaction in geomechanics, Powder Technol. 239 (2013) 248–258

  5. [5]

    Zhong, A

    W. Zhong, A. Yu, X. Liu, Z. Tong, H. Zhang, DEM/CFD- DEM Modelling of Non-spherical Particulate Systems: Theoretical Developments and Applications, Powder Technol. 302 (2016) 108–152

  6. [6]

    Kieckhefen, S

    P. Kieckhefen, S. Pietsch, M. Dosta, S. Heinrich, Possibil- ities and limits of computational fluid dynamics – discrete element method simulations in process eengineering: A review of recent advancements and future trends, Annual Review of Chemical and Biomolecular Engineering 11 (1) (2020) 397–422, pMID: 32169000

  7. [7]

    H. Ma, L. Zhou, Z. Liu, M. Chen, X. Xia, Y . Zhao, A review of recent development for the CFD-DEM investi- gations of non-spherical particles, Powder Technol. 412 (2022) 117972

  8. [8]

    J. Gan, Z. Zhou, A. Yu, A GPU-based DEM approach for modelling of particulate systems, Powder Technology 301 (2016) 1172–1182

  9. [9]

    Lu, GPU accelerated MFiX-DEM simulations of gran- ular and multiphase flows, Particuology 62 (2022) 14–24

    L. Lu, GPU accelerated MFiX-DEM simulations of gran- ular and multiphase flows, Particuology 62 (2022) 14–24

  10. [10]

    S. Fu, L. Wang, GPU-based unresolved LBM-DEM for fast simulation of gas–solid flows, Chemical Engineering Journal 465 (2023) 142898

  11. [11]

    Nickolls, I

    J. Nickolls, I. Buck, M. Garland, K. Skadron, Scalable parallel programming with CUDA: Is CUDA the paral- lel programming model that application developers have been waiting for?, Queue 6 (2) (2008) 40–53

  12. [12]

    HIP (C++Heterogeneous- Compute Interface for Portability): https://rocm.docs.amd.com/projects/HIP/en/latest/. [link]

  13. [13]

    Wienke, P

    S. Wienke, P. Springer, C. Terboven, D. an Mey, Ope- nACC – First Experiences with Real-World Applications, in: C. Kaklamanis, T. Papatheodorou, P. G. Spirakis (Eds.), Euro-Par 2012 Parallel Processing, Springer Berlin Heidelberg, Berlin, Heidelberg, 2012, pp. 859–870

  14. [14]

    Chandra, R

    R. Chandra, R. Menon, L. Dagum, D. Kohr, D. Maydan, J. McDonald, Parallel Programming in OpenMP, 1st Edi- tion, Elsevier, 2018

  15. [15]

    Munshi, The OpenCL specification, in: IEEE Hot Chips 21 Symposium (HCS), 2009, pp

    A. Munshi, The OpenCL specification, in: IEEE Hot Chips 21 Symposium (HCS), 2009, pp. 1–314

  16. [16]

    Alpay, V

    A. Alpay, V . Heuveline, SYCL beyond OpenCL: The ar- chitecture, current state and future direction of hipSYCL, in: Proceedings of the International Workshop on OpenCL, IWOCL ’20, Association for Computing Ma- chinery, New York, NY , USA, 2020

  17. [17]

    N. Bell, J. Hoberock, Thrust: A Productivity-Oriented Li- brary for CUDA, in: W. mei W. Hwu (Ed.), GPU Comput- ing Gems Jade Edition, Applications of GPU Computing Series, Morgan Kaufmann, Boston, 2012, pp. 359–371

  18. [18]

    Carter Edwards, C

    H. Carter Edwards, C. R. Trott, D. Sunderland, Kokkos: Enabling manycore performance portability through poly- morphic memory access patterns, Journal of Parallel and Distributed Computing 74 (12) (2014) 3202–3216, domain-Specific Languages and High-Level Frameworks for High-Performance Computing

  19. [19]

    G. E. Blelloch, Prefix sums and their applications, Tech. Rep. CMU-CS-90-190, Carnegie Mellon Univer- sity, Pittsburgh, PA, technical Report, School of Computer Science (1990)

  20. [20]

    Sengupta, M

    S. Sengupta, M. Harris, Y . Zhang, J. D. Owens, Scan primitives for gpu computing, in: Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware, Eurographics Association, Aire- la-Ville, Switzerland, 2007, pp. 97–106

  21. [21]

    J. Shun, G. E. Blelloch, A simple and practical linear- work parallel algorithm for connectivity, in: Proceedings of the 26th ACM Symposium on Parallelism in Algo- rithms and Architectures (SPAA ’14), ACM, New York, NY , USA, 2014, pp. 143–153

  22. [22]

    Dhulipala, G

    D. Dhulipala, G. E. Blelloch, J. Shun, Theoretically effi- cient parallel graph algorithms can be fast and scalable, ACM Transactions on Parallel Computing 5 (1) (2018) 1– 49

  23. [23]

    Brahmakshatriya, A

    A. Brahmakshatriya, A. Brahmakshatriya, P. Sadayappan, J. D. Owens, Compilation techniques for graph algorithms on gpus, arXiv preprint arXiv:2012.07990 (2020)

  24. [24]

    Buluç, Linear algebraic primitives for parallel comput- ing on large graphs, Ph.D

    A. Buluç, Linear algebraic primitives for parallel comput- ing on large graphs, Ph.D. thesis, University of California, Santa Barbara (2010)

  25. [25]

    H. C. Edwards, C. R. Trott, D. Sunderland, Kokkos: En- abling manycore performance portability through poly- morphic memory access patterns, Journal of Parallel and Distributed Computing 74 (12) (2014) 3202–3216

  26. [26]

    AMD, Hipstdpar: Standard c++parallel algorithms on amd gpus,https://gpuopen.com/learn/ hipstdpar/, accessed: 2025-10-14 (2023). 23

  27. [27]

    Larkin, No More Porting: Coding for GPUs with Standard C++, Fortran, and Python, in: NVIDIA GTC (spring), 2022

    J. Larkin, No More Porting: Coding for GPUs with Standard C++, Fortran, and Python, in: NVIDIA GTC (spring), 2022

  28. [28]

    Larkin, A

    J. Larkin, A. Stulova, No more porting: Accelerated computing with Standard C++, Fortran, and Python, in: NVIDIA GTC (spring), 2024

  29. [29]

    J. Latt, C. Coreixas, J. Beny, Cross-platform programming model for many-core lattice Boltzmann simulations, PLoS One 16 (4) (2021) 1–29

  30. [30]

    J. Latt, C. Coreixas, Multi-GPU acceleration of PALA- BOS fluid solver using C++standard parallelism, arXiv preprint arXiv:2506.09242 (2025)

  31. [31]

    Coreixas, J

    C. Coreixas, J. Latt, Gpu-based compressible lattice boltz- mann simulations on non-uniform grids using standard c++parallelism: From best practices to aerodynamics, aeroacoustics and supersonic flow simulations, Computer Physics Communications 317 (2025) 109833

  32. [32]

    Coreixas, J

    C. Coreixas, J. Latt, X. Shan, No more porting: GPU acceleration of lattice Boltzmann and finite difference solvers via C++parallel algorithms, in: Symposium on high fidelity LES/DNS (HiFiLeD) for industrial relevant flow configurations, 2022

  33. [33]

    Rettinger, U

    C. Rettinger, U. Rüde, A comparative study of fluid– particle coupling methods for fully resolved lattice boltz- mann simulations, Computers & Fluids 154 (2017) 74–89

  34. [34]

    Rettinger, U

    C. Rettinger, U. Rüde, A coupled lattice boltzmann method and discrete element method for discrete particle simulations of particulate flows, Computers & Fluids 172 (2018) 706–719

  35. [35]

    Thorimbert, F

    Y . Thorimbert, F. Marson, A. Parmigiani, B. Chopard, J. Latt, Lattice boltzmann simulation of dense rigid spherical particle suspensions using immersed boundary method, Computers & Fluids 166 (2018) 286–294

  36. [36]

    D. R. Noble, J. R. Torczynski, A Lattice-Boltzmann Method for Partially Saturated Computational Cells, In- ternational Journal of Modern Physics C 09 (08) (1998) 1189–1201

  37. [37]

    Rettinger, U

    C. Rettinger, U. Rüde, An efficient four-way coupled lat- tice boltzmann – discrete element method for fully re- solved simulations of particle-laden flows, J. Comput. Phys. 453 (2022) 110942

  38. [38]

    Krüger, H

    T. Krüger, H. Kusumaatmaja, A. Kuzmin, O. Shardt, G. Silva, E. M. Viggen, The Lattice Boltzmann Method: Principles and Practice, Graduate Texts in Physics, Springer International Publishing, Cham, 2017

  39. [39]

    Kloss, C

    C. Kloss, C. Goniva, LIGGGHTS – Open source dis- crete element simulations of granular materials based on LAMMPS, John Wiley & Sons, Ltd, 2011, pp. 781–788

  40. [40]

    Lambert, L

    B. Lambert, L. Weynans, M. Bergmann, Methodology for numerical simulations of ellipsoidal particle-laden flows, International Journal for Numerical Methods in Fluids 92 (8) (2020) 855–873

  41. [41]

    R. D. Mindlin, Compliance of Elastic Bodies in Contact, Journal of Applied Mechanics 16 (3) (2021) 259–268

  42. [42]

    Maggio-Aprile, GPU-based simulation of dense sus- pensions: Coupling the DEM and the LBM in standard C++, Master thesis (2017)

    R. Maggio-Aprile, GPU-based simulation of dense sus- pensions: Coupling the DEM and the LBM in standard C++, Master thesis (2017)

  43. [43]

    experiments

    L. Verlet, Computer "experiments" on classical flu- ids. I. Thermodynamical properties of Lennard-Jones molecules, Phys. Rev. 159 (1967) 98–103

  44. [44]

    R. D. Skeel, Integration schemes for molecular dynam- ics and related applications, Springer Berlin Heidelberg, Berlin, Heidelberg, 1999, pp. 119–176

  45. [45]

    Shan, X.-F

    X. Shan, X.-F. Yuan, H. Chen, Kinetic theory representa- tion of hydrodynamics: A way beyond the Navier–Stokes equation, J. Fluid Mech. 550 (2006) 413–441

  46. [46]

    P. J. Dellar, An interpretation and derivation of the lattice Boltzmann method using Strang splitting, Comput. Math. Appl. 65 (2) (2013) 129 – 141

  47. [47]

    Schornbaum, U

    F. Schornbaum, U. Rüde, Massively parallel algorithms for the lattice Boltzmann method on nonuniform grids, SIAM J. Sci. Comput 38 (2) (2016) C96–C126

  48. [48]

    Marié, D

    S. Marié, D. Ricot, P. Sagaut, Comparison between lattice Boltzmann method and Navier–Stokes high or- der schemes for computational aeroacoustics, J. Comput. Phys. 228 (4) (2009) 1056 – 1070

  49. [49]

    A. Suss, I. Mary, T. Le Garrec, S. Marié, Comprehensive comparison between the lattice Boltzmann and Navier- Stokes methods for aerodynamic and aeroacoustic appli- cations, Comput. Fluids 257 (2023) 105881

  50. [50]

    P. L. Bhatnagar, E. P. Gross, M. Krook, A Model for Col- lision Processes in Gases. I. Small Amplitude Processes in Charged and Neutral One-Component Systems, Physi- cal Review 94 (3) (1954) 511–525, publisher: American Physical Society

  51. [51]

    Coreixas, B

    C. Coreixas, B. Chopard, J. Latt, Comprehensive compar- ison of collision models in the lattice Boltzmann frame- work: Theoretical investigations, Phys. Rev. E 100 (2019) 033305

  52. [52]

    Coreixas, G

    C. Coreixas, G. Wissocq, B. Chopard, J. Latt, Impact of collision models on the physical properties and the stabil- ity of lattice Boltzmann methods, Phil. Trans. R. Soc. A 378 (2175) (2020) 20190397

  53. [53]

    Ginzburg, F

    I. Ginzburg, F. Verhaeghe, D. d’Humières, Two- relaxation-time lattice Boltzmann scheme: About parametrization, velocity, pressure and mixed boundary conditions, Commun. Comput. Phys. 3 (2) (2008) 427– 478. 24

  54. [54]

    d’Humières, I

    D. d’Humières, I. Ginzburg, Viscosity independent nu- merical errors for lattice Boltzmann models: From recur- rence equations to “magic” collision numbers, Comput. Math. Appl. 58 (5) (2009) 823 – 840

  55. [55]

    Pan, L.-S

    C. Pan, L.-S. Luo, C. T. Miller, An evaluation of lattice Boltzmann schemes for porous medium flow simulation, Comput. Fluids 35 (8–9) (2006) 898 – 909, proceedings of the First International Conference for Mesoscopic Meth- ods in Engineering and Science

  56. [56]

    A. J. C. Ladd, Numerical simulations of particulate suspensions via a discretized Boltzmann equation. Part

  57. [57]

    Fluid Mech

    Theoretical foundation, J. Fluid Mech. 271 (1994) 285–309

  58. [58]

    Krüger, Computer Simulation Study of Collective Phe- nomena in Dense Suspensions of Red Blood Cells under Shear, Vieweg+Teubner Verlag, Wiesbaden, 2012

    T. Krüger, Computer Simulation Study of Collective Phe- nomena in Dense Suspensions of Red Blood Cells under Shear, Vieweg+Teubner Verlag, Wiesbaden, 2012

  59. [59]

    Rettinger, S

    C. Rettinger, S. Eibl, U. Rüde, B. V owinckel, Rheology of mobile sediment beds in laminar shear flow: effects of creep and polydispersity, Journal of Fluid Mechanics 932 (2022) A1

  60. [60]

    G. T. Nguyen, E. L. Chan, T. Tsuji, T. Tanaka, K. Washino, Resolved cfd–dem coupling simulation using volume penalisation method, Advanced Powder Technol- ogy 32 (1) (2021) 225–236

  61. [61]

    Tsigginos, J

    C. Tsigginos, J. Meng, X.-J. Gu, D. R. Emerson, Cou- pled LBM-DEM simulations using the partially saturated method: Theoretical and computational aspects, Powder Technology 405 (2022) 117556

  62. [62]

    B. D. Jones, J. R. Williams, Fast computation of accurate sphere-cube intersection volume, Engineering Computa- tions 34 (4) (2017) 1204–1216, publisher: Emerald Pub- lishing Limited

  63. [63]

    Kemmler, C

    S. Kemmler, C. Rettinger, U. Rüde, P. Cuéllar, H. Köstler, Efficiency and scalability of fully-resolved fluid-particle simulations on heterogeneous cpu-gpu architectures, The International Journal of High Performance Computing Applications 39 (3) (2025) 345–363

  64. [64]

    Seil, LBDEMcoupling: implementation, validation, and applications of a coupled open-source solver for fluid- particle systems/eingereicht von Dipl.-Ing

    P. Seil, LBDEMcoupling: implementation, validation, and applications of a coupled open-source solver for fluid- particle systems/eingereicht von Dipl.-Ing. Philippe Seil, 2016

  65. [65]

    Y . C. Chung, J. Y . Ooi, Benchmark tests for verifying discrete element modelling codes at particle impact level, Granular Matter 13 (5) (2011) 643–656

  66. [66]

    Rettinger, U

    C. Rettinger, U. Rüde, An efficient four-way coupled lat- tice Boltzmann - discrete element method for fully re- solved simulations of particle-laden flows, Tech. Rep. arXiv:2003.01490, arXiv, arXiv:2003.01490 [physics] type: article (Mar. 2020)

  67. [67]

    Doménech-Carbó, Analysis of rolling friction effects on oblique rebound by redefining tangential restitution and friction, Physics of Fluids 31 (4) (2019) 043302

    A. Doménech-Carbó, Analysis of rolling friction effects on oblique rebound by redefining tangential restitution and friction, Physics of Fluids 31 (4) (2019) 043302

  68. [68]

    S. F. Foerster, M. Y . Louge, H. Chang, K. Allia, Measure- ments of the collision properties of small spheres, Physics of Fluids 6 (3) (1994) 1108–1115

  69. [69]

    Boltzmann, Weitere studien über das wärmegle- ichgewicht unter gasmolekülen, Wien

    L. Boltzmann, Weitere studien über das wärmegle- ichgewicht unter gasmolekülen, Wien. Ber. 66 (1872) 275–370

  70. [70]

    Bhatnagar, E

    P. Bhatnagar, E. Gross, M. Krook, A model for colli- sion processes in gases. I. Small amplitude processes in charged and neutral one-component systems, Phys. Rev. 94 (1954) 511–525

  71. [71]

    ten Cate, C

    A. ten Cate, C. H. Nieuwstad, J. J. Derksen, H. E. A. Van den Akker, Particle imaging velocimetry experiments and lattice-Boltzmann simulations on a single sphere set- tling under gravity, Physics of Fluids 14 (11) (2002) 4012–4025

  72. [72]

    G. B. Jeffery, L. N. G. Filon, The motion of ellipsoidal particles immersed in a viscous fluid, Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 102 (715) (1922) 161–179

  73. [73]

    Einstein, Eine neue bestimmung der moleküldimensio- nen, Annalen der Physik 324 (2) (1906) 289–306

    A. Einstein, Eine neue bestimmung der moleküldimensio- nen, Annalen der Physik 324 (2) (1906) 289–306

  74. [74]

    I. M. Krieger, T. J. Dougherty, A mechanism for non- newtonian flow in suspensions of rigid spheres, Transac- tions of The Society of Rheology 3 (1) (1959) 137–152

  75. [75]

    S. V . Lishchuk, I. Halliday, C. M. Care, Shear viscosity of bulk suspensions at low reynolds number with the three- dimensional lattice boltzmann method, Phys. Rev. E 74 (2006) 017701

  76. [76]

    Lorenz, A

    E. Lorenz, A. G. Hoekstra, A. Caiazzo, Lees-edwards boundary conditions for lattice boltzmann suspension simulations, Phys. Rev. E 79 (2009) 036706

  77. [77]

    Thorimbert, F

    Y . Thorimbert, F. Marson, A. Parmigiani, B. Chopard, J. Lätt, Lattice Boltzmann simulation of dense rigid spherical particle suspensions using immersed boundary method, Computers & Fluids (2018) 286–294

  78. [78]

    J. Guo, Q. Zhou, R. C.-K. Wong, Effects of volume frac- tion and particle shape on the rheological properties of oblate spheroid suspensions, Physics of Fluids 33 (8) (2021) 081703

  79. [79]

    I. M. Krieger, T. J. Dougherty, A Mechanism for Non- Newtonian Flow in Suspensions of Rigid Spheres, Trans- actions of The Society of Rheology 3 (1) (1959) 137–152. 25

  80. [80]

    Bhowmick, J

    T. Bhowmick, J. Latt, Y . Wang, G. Bagheri, Palabos turret: A particle-resolved numerical framework for settling dy- namics of arbitrary-shaped particles, Computers & Fluids 299 (2025) 106696

Showing first 80 references.