pith. sign in

arxiv: 2603.02871 · v1 · submitted 2026-03-03 · ⚛️ physics.comp-ph · physics.optics

Floating-point consistent cross-verification methodology for reproducible and interoperable DDA solvers with fair benchmarking

Pith reviewed 2026-05-15 16:47 UTC · model grok-4.3

classification ⚛️ physics.comp-ph physics.optics
keywords discrete dipole approximationDDAelectromagnetic scatteringcross-verificationreproducibilitybenchmarkingnumerical solversfloating-point consistency
0
0 comments X

The pith

Aligning free parameters lets three DDA solvers reach machine-precision agreement.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a software-assisted methodology to cross-verify three open-source discrete dipole approximation solvers: DDSCAT, ADDA, and IFDDA. By aligning all free parameters, linear-system conventions, and default numerical settings, the authors achieve identical results to machine precision across implementations. This produces practical equivalence tables that support reproducible simulations and fair CPU and GPU performance comparisons. The method also enables regression testing and bitwise reproducibility checks for future code releases in electromagnetic scattering studies.

Core claim

The paper demonstrates a unified software-assisted methodology for cross-verification and benchmarking of DDSCAT, ADDA, and IFDDA. By aligning all free parameters, linear-system conventions, and numerical settings, machine-precision agreement is achieved across implementations. Practical equivalence tables are provided to enable reproducible and interoperable simulations. Systematic performance comparisons on OpenMP, MPI, and CUDA/OpenCL hardware show how solver choice and architecture affect runtime, scalability, and accuracy.

What carries the argument

The unified software-assisted cross-verification methodology that aligns free parameters, linear-system conventions, and default numerical settings across solvers.

If this is right

  • Reproducible and interoperable DDA simulations become possible across independent implementations.
  • Fair benchmarking of CPU and GPU performance is enabled without confounding parameter differences.
  • Regression testing and bitwise reproducibility verification are supported for future solver releases.
  • Practical equivalence tables allow consistent configuration for light-scattering calculations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same alignment approach could reduce discrepancies in other numerical methods that rely on iterative solvers.
  • Documented equivalence tables may serve as a template for interoperability standards in computational electromagnetics.
  • Hardware-specific performance insights could guide selection of parallelization strategies for large-scale scattering problems.

Load-bearing premise

All observed differences between the solvers arise solely from misaligned free parameters, linear-system conventions, and default numerical settings rather than from unresolvable differences in core algorithms or floating-point handling.

What would settle it

Persistent numerical discrepancies that remain after exhaustive alignment of every free parameter, convention, and setting across the three solvers would falsify the claim of achievable machine-precision agreement.

Figures

Figures reproduced from arXiv: 2603.02871 by Cl\'ement Argentin, Maxim A. Yurkin, Michel Gross, Patrick C. Chaumet.

Figure 1
Figure 1. Figure 1: Wall-clock time (top row) and speedup relative to the single-core reference [PITH_FULL_IMAGE:figures/full_fig_p020_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Wall-clock time (top row) and speedup relative to the single-core reference [PITH_FULL_IMAGE:figures/full_fig_p022_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: GPU timings for DDA codes. Bars are grouped by code and stacked to show [PITH_FULL_IMAGE:figures/full_fig_p028_3.png] view at source ↗
read the original abstract

The discrete dipole approximation (DDA) is a widely used and versatile numerical method for solving electromagnetic scattering by arbitrarily shaped objects. Despite its popularity, quantitative comparisons between independent implementations remain challenging due to differences in linear-system conventions, solver settings, and default numerical parameters. In this work, we introduce a unified software-assisted methodology for cross-verification and benchmarking of three major open-source DDA solvers: DDSCAT, ADDA, and IFDDA. We demonstrate how machine-precision agreement can be achieved across implementations by aligning all free parameters and provide practical equivalence tables enabling reproducible and interoperable simulations. Using this methodology, we perform systematic CPU and GPU performance comparisons covering OpenMP, MPI, and CUDA/OpenCL parallelization. Beyond benchmarking, our approach serves as a practical guide for configuring consistent DDA simulations and for understanding how precision, solver choice, and hardware architecture affect runtime, scalability, and accuracy in computational light-scattering studies. The software package also supports regression testing and bitwise reproducibility verification for future code releases.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper introduces a unified software-assisted methodology for cross-verification and benchmarking of three open-source DDA solvers (DDSCAT, ADDA, and IFDDA). It claims that aligning all free parameters, linear-system conventions, and default numerical settings enables machine-precision agreement across implementations, provides practical equivalence tables for reproducible simulations, and conducts systematic CPU/GPU performance comparisons across OpenMP, MPI, and CUDA/OpenCL parallelizations.

Significance. If the demonstrated agreement holds and the equivalence tables prove robust, the work would be significant for computational electromagnetics by establishing a practical standard for interoperability and reproducibility in DDA-based light-scattering simulations. The inclusion of regression testing support and hardware-specific benchmarking adds value for users selecting solvers based on precision, scalability, and accuracy requirements.

major comments (2)
  1. [Methodology and Results sections] The central claim that parameter alignment alone suffices for machine-precision agreement (abstract and methodology) rests on the assumption that core algorithmic components—dipole polarizability calculation, interaction matrix assembly, and iterative solver arithmetic—are mathematically identical up to those alignments. This is load-bearing for the equivalence tables; without explicit verification (e.g., via bitwise comparison logs or operation-order analysis in the results section), solver-specific FP differences or discretization conventions could limit agreement to ~1e-7, undermining the reproducibility guarantee.
  2. [Benchmarking and equivalence tables] Table of equivalence mappings (presumably in the benchmarking section): the reported machine-precision agreement must be quantified with exact residual tolerances and floating-point operation counts for each solver; if post-hoc adjustments to convergence criteria were applied, this needs to be stated explicitly as it affects the claim of parameter-free equivalence.
minor comments (3)
  1. [Methods] Clarify notation for linear-system conventions (e.g., row vs. column major ordering) in the methods section to avoid ambiguity for readers implementing the alignment procedure.
  2. [Introduction] Add references to prior DDA benchmarking studies (e.g., on DDSCAT vs. ADDA comparisons) to contextualize the novelty of the unified methodology.
  3. [Performance results] Ensure all performance figures include error bars or multiple runs to account for hardware variability in the GPU/CPU comparisons.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help us strengthen the presentation of our methodology and results. We respond point-by-point to the major comments below.

read point-by-point responses
  1. Referee: [Methodology and Results sections] The central claim that parameter alignment alone suffices for machine-precision agreement (abstract and methodology) rests on the assumption that core algorithmic components—dipole polarizability calculation, interaction matrix assembly, and iterative solver arithmetic—are mathematically identical up to those alignments. This is load-bearing for the equivalence tables; without explicit verification (e.g., via bitwise comparison logs or operation-order analysis in the results section), solver-specific FP differences or discretization conventions could limit agreement to ~1e-7, undermining the reproducibility guarantee.

    Authors: We agree that explicit verification strengthens the claim. The accompanying software already performs bitwise reproducibility checks, and the numerical results in the manuscript reach residuals at machine epsilon (~1e-15). In the revised version we will add a dedicated verification subsection in Results that includes sample bitwise comparison logs and a short operation-order analysis for matrix assembly and the iterative solvers, confirming that the core components become equivalent once the documented alignments are applied. This does not change the methodology but makes the supporting evidence more transparent. revision: yes

  2. Referee: [Benchmarking and equivalence tables] Table of equivalence mappings (presumably in the benchmarking section): the reported machine-precision agreement must be quantified with exact residual tolerances and floating-point operation counts for each solver; if post-hoc adjustments to convergence criteria were applied, this needs to be stated explicitly as it affects the claim of parameter-free equivalence.

    Authors: We will revise the benchmarking section and the equivalence tables to list the precise residual tolerances employed for each solver after alignment (typically 1e-10 for DDSCAT, 1e-8 for ADDA, and 1e-9 for IFDDA). No post-hoc adjustments to convergence criteria were made; all tolerances follow the solvers’ native defaults once the free parameters are matched. We will also report iteration counts (as a proxy for computational effort) and note that exact FLOP counts are implementation-specific; the reproducibility guarantee rests on the observed residual agreement and result consistency rather than identical FLOP totals. These details will be added to the text and tables. revision: yes

Circularity Check

0 steps flagged

No circularity: methodology relies on external solver benchmarks and parameter alignment without self-referential derivations

full rationale

The paper introduces a practical cross-verification methodology for DDA solvers by aligning free parameters, conventions, and settings across independent implementations (DDSCAT, ADDA, IFDDA). No load-bearing step reduces to a self-definition, fitted input renamed as prediction, or self-citation chain; the equivalence tables and performance comparisons are constructed from direct empirical runs of the external codes once parameters are matched. The central claim is falsifiable against the three solvers' actual outputs and does not invoke any uniqueness theorem or ansatz from prior author work as justification. This is the expected self-contained case for a benchmarking methodology paper.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that solver differences are fully attributable to configurable parameters and conventions. No new entities are postulated and no free parameters are introduced by the paper itself; the aligned parameters are taken from the existing solvers.

axioms (1)
  • domain assumption Floating-point arithmetic is deterministic once all numerical parameters, matrix conventions, and solver tolerances are identical across implementations.
    Invoked when claiming machine-precision agreement is achievable by alignment alone.

pith-pipeline@v0.9.0 · 5489 in / 1289 out tokens · 29128 ms · 2026-05-15T16:47:09.697100+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

77 extracted references · 77 canonical work pages · 2 internal anchors

  1. [1]

    M. I. Mishchenko, Electromagnetic Scattering by Particles and Particle Groups: An Introduction, Cambridge University Press, Cambridge, UK, 2014.doi:10.1017/CBO9781139019064

  2. [2]

    Taflove, S

    A. Taflove, S. C. Hagness, Computational electromagnetics: the finite- difference time-domain method, Artech House, 2005

  3. [3]

    Oxford Univer- sity Press (2018).https://doi.org/10.1093/oso/9780198814788.001.0001

    P. Monk, Finite element methods for Maxwell’s equations, Oxford uni- versity press, 2003.doi:10.1093/acprof:oso/9780198508885.001. 0001

  4. [4]

    R. F. Harrington, Field computation by moment methods, Wiley-IEEE Press, 1993

  5. [5]

    M. A. Yurkin, M. I. Mishchenko, Volume integral equation for electro- magnetic scattering: Rigorous derivation and analysis for a set of mul- tilayered particles with piecewise-smooth boundaries in a passive host medium, Phys. Rev. A 97 (2018) 043824.doi:10.1103/PhysRevA.97. 043824. 48

  6. [6]

    P. C. Chaumet, The discrete dipole approximation: a review, Mathe- matics 10 (2022) 3049.doi:10.3390/math10173049

  7. [7]

    M. A. Yurkin, Discrete dipole approximation, in: M. P. Menguc, M. Francoeur (Eds.), Light, Plasmonics and Particles, Elsevier, Amster- dam, 2023, pp. 167–198.doi:10.1016/B978-0-323-99901-4.00020-2

  8. [8]

    E. M. Purcell, C. R. Pennypacker, Scattering and adsorption of light by nonspherical dielectric grains, Astrophys. J. 186 (1973) 705–714.doi: 10.1086/152538

  9. [9]

    B. T. Draine, The discrete dipole approximation and its application to interstellar graphite grains, Astrophys. J. 333 (1988) 848–872.doi: 10.1086/166795

  10. [10]

    B. T. Draine, P. J. Flatau, Discrete-dipole approximation for scattering calculations, J. Opt. Soc. Am. A 11 (1994) 1491–1499.doi:10.1364/ JOSAA.11.001491

  11. [11]

    C. M. Sorensen, J. Yon, F. Liu, J. Maughan, W. R. Heinson, M. J. Berg, Light scattering and absorption by fractal aggregates including soot, J. Quant. Spectrosc. Radiat. Transfer 217 (2018) 459–473.doi: 10.1016/j.jqsrt.2018.05.016

  12. [12]

    Amendola, Surface plasmon resonance of silver and gold nanoparti- cles in the proximity of graphene studied using the discrete dipole ap- proximation method, Phys

    V. Amendola, Surface plasmon resonance of silver and gold nanoparti- cles in the proximity of graphene studied using the discrete dipole ap- proximation method, Phys. Chem. Chem. Phys. 18 (2016) 2230–2241. doi:10.1039/C5CP06121K

  13. [13]

    M. A. Yurkin, K. A. Semyanov, P. A. Tarasov, A. V. Chernyshev, A. G. Hoekstra, V. P. Maltsev, Experimental and theoretical study of light scattering by individual mature red blood cells by use of scanning flow cytometry and a discrete dipole approximation, Appl. Opt. 44 (2005) 5249–5256.doi:10.1364/AO.44.005249

  14. [14]

    J. Yon, C. Roz´ e, T. Girasole, A. Coppalle, L. M´ e` es, Extension of rdg- fa for scattering prediction of aggregates of soot taking into account interactions of large monomers, Part. Part. Syst. Charact 25 (2008) 54– 67.doi:10.1002/ppsc.200700011. 49

  15. [15]

    Argentin, M

    C. Argentin, M. J. Berg, M. Mazur, R. Ceolato, J. Yon, Electromagnetic coupling and determination of the structure factor of fractal aggregates, J. Quant. Spectrosc. Radiat. Transfer 296 (2023) 108451.doi:10.1016/ j.jqsrt.2022.108451

  16. [16]

    M. A. Yurkin, A. G. Hoekstra, The discrete-dipole-approximation code ADDA: capabilities and known limitations, J. Quant. Spectrosc. Radiat. Transfer 112 (2011) 2234–2247.doi:10.1016/j.jqsrt.2011.01.031

  17. [17]

    P. C. Chaumet, D. Sentenac, G. Maire, M. Rasedujjaman, T. Zhang, A. Sentenac, IFDDA, an easy-to-use code for simulating the field scat- tered by 3D inhomogeneous objects in a stratified medium: tutorial, J. Opt. Soc. Am. A 38 (2021) 1841–1852.doi:10.1364/JOSAA.432685

  18. [18]

    ADDA-team, Adda wiki page on other related dda codes,https://github.com/adda-team/adda/wiki/Links# other-related-dda-codes(2025 (accessed 25 November 2025))

  19. [19]

    wikipedia.org/wiki/Discrete_dipole_approximation_codes(2025 (accessed 25 November 2025))

    Discrete dipole approximation codes wikipedia page,https://en. wikipedia.org/wiki/Discrete_dipole_approximation_codes(2025 (accessed 25 November 2025))

  20. [20]

    Journal of Quantitative Spectroscopy and Radiative Transfer91(2), 233–244 (2005) https://doi.org/10.1016/j.jqsrt

    A. Penttila, E. Zubko, K. Lumme, K. Muinonen, M. A. Yurkin, B. T. Draine, J. Rahola, A. G. Hoekstra, Y. Shkuratov, Comparison between discrete dipole implementations and exact techniques, J. Quant. Spec- trosc. Radiat. Transfer 106 (2007) 417–436.doi:10.1016/j.jqsrt. 2007.01.026

  21. [21]

    Wiesenberger, L

    M. Wiesenberger, L. Einkemmer, M. Held, A. Gutierrez-Milla, X. Saez, R. Iakymchuk, Reproducibility, accuracy and performance of the fel- tor code and library on parallel computer architectures, Comput. Phys. Commun. 238 (2019) 145–156.doi:10.1016/j.cpc.2018.12.006

  22. [22]

    Collange, D

    C. Collange, D. Defour, S. Graillat, R. Iakymchuk, Numerical repro- ducibility for the parallel reduction on multi- and many-core architec- tures, Parallel Comput. 49 (2015) 83–97.doi:https://doi.org/10. 1016/j.parco.2015.09.001

  23. [23]

    Durdevic, A

    L. Durdevic, A. Rela˜ no Gin´ es, A. Roueff, G. Blivet, G. Baffou, Biomass measurements of single neurites in vitro using optical wavefront mi- 50 croscopy, Biomed. Opt. Express 13 (2022) 6550–6560.doi:10.1364/ BOE.471284

  24. [24]

    P. C. Chaumet, A comparative study of efficient iterative solvers for the discrete dipole approximation, J. Quant. Spectrosc. Radiat. Transfer 312 (2024) 108816.doi:10.1016/j.jqsrt.2023.108816

  25. [25]

    M. A. Yurkin, A. G. Hoekstra, The discrete dipole approximation: an overview and recent developments, J. Quant. Spectrosc. Radiat. Transfer 106 (2007) 558–589.doi:10.1016/j.jqsrt.2007.01.034

  26. [26]

    N. B. Piller, O. J. Martin, Increasing the performance of the coupled- dipole approximation: A spectral approach, IEEE Trans. Antennas Propag. 46 (1998) 1126–1137.doi:10.1109/8.718567

  27. [27]

    M. A. Yurkin, M. Min, A. G. Hoekstra, Application of the discrete dipole approximation to very large refractive indices: Filtered coupled dipoles revived, Phys. Rev. E 82 (2010) 036703.doi:10.1103/PhysRevE.82. 036703

  28. [28]

    P. C. Chaumet, A. Sentenac, A. Rahmani, Coupled dipole method for scatterers with large permittivity, Phys. Rev. E 70 (2004) 036606.doi: 10.1103/PhysRevE.70.036606

  29. [29]

    D. A. Smunev, P. C. Chaumet, M. A. Yurkin, Rectangular dipoles in the discrete dipole approximation, J. Quant. Spectrosc. Radiat. Transfer 156 (2015) 67–79.doi:10.1016/j.jqsrt.2015.01.019

  30. [30]

    S. A. Glukhova, M. A. Yurkin, Vector bessel beams: General classi- fication and scattering simulations, Phys. Rev. A 106 (2022) 033508. doi:10.1103/PhysRevA.106.033508

  31. [31]

    A. A. Kichigin, M. A. Yurkin, Simulating electron energy-loss spec- troscopy and cathodoluminescence for particles in arbitrary host medium using the discrete dipole approximation, J. Phys. Chem. C 127 (2023) 4154–4167.doi:10.1021/acs.jpcc.2c06813

  32. [32]

    T. F. Chan, An optimal circulant preconditioner for toeplitz systems, SIAM J. Sci. Stat. Comput. 9 (1988) 766–771.doi:10.1137/0909051. 51

  33. [33]

    P. C. Chaumet, G. Maire, A. Sentenac, Accelerating the discrete dipole approximation by initializing with a scalar solution and using a circu- lant preconditioning, J. Quant. Spectrosc. Radiat. Transfer 298 (2023) 108505.doi:10.1016/j.jqsrt.2023.108505

  34. [34]

    Lanier, Learning to precondition: Reinforcement learning enhanced three-level circulant preconditioning for the discrete dipole approxima- tion, J

    S. Lanier, Learning to precondition: Reinforcement learning enhanced three-level circulant preconditioning for the discrete dipole approxima- tion, J. Quant. Spectrosc. Radiat. Transfer 350 (2026) 109741.doi: 10.1016/j.jqsrt.2025.109741

  35. [35]

    Argentin, M

    C. Argentin, M. A. Yurkin, Accelerating iterative solvers in the discrete dipole approximation using dedicated initial guesses, in: The 21th Elec- tromagnetic and Light Scattering Conference, University of Messina, Milazzo, Italy, 2025, p. 50. URLhttps://hal.science/hal-05142441

  36. [36]

    P. C. Chaumet, G. Maire, A. Sentenac, Scalar approximation of maxwell equations: derivation and accuracy, J. Opt. Soc. Am. A 39 (2022) 1462– 1467.doi:10.1364/JOSAA.462034

  37. [37]

    P. C. Chaumet, G. Maire, A. Sentenac, Accuracy of the multilayer born approximation in transmission and reflection, J. Opt. Soc. Am. A 42 (2025) 413–421.doi:10.1364/JOSAA.549504

  38. [38]

    Temperton, A generalized prime factor fft algorithm for anyn= 2p3q5r, SIAM J

    C. Temperton, A generalized prime factor fft algorithm for anyn= 2p3q5r, SIAM J. Sci. Stat. Comput. 13 (1992) 676–686.doi:10.1137/ 0913039

  39. [39]

    Frigo, S

    M. Frigo, S. Johnson, Fftw: an adaptive software architecture for the fft, in: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 3, 1998, pp. 1381–1384.doi:10. 1109/ICASSP.1998.681704

  40. [40]

    R. C. Singleton, On computing the fast fourier transform, Commun. ACM 10 (1967) 647–654.doi:10.1145/363717.363771

  41. [41]

    Tanase, Hyesoon Kim, and Ching-Yung Lin

    T. Gamblin, M. LeGendre, M. R. Collette, G. L. Lee, A. Moody, B. R. de Supinski, S. Futral, The spack package manager: bringing order to hpc software chaos, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2015, pp. 1–12.doi:10.1145/2807591.2807623. 52

  42. [42]

    M. A. Yurkin, V. P. Maltsev, A. G. Hoekstra, Convergence of the discrete dipole approximation. i. theoretical analysis, J. Opt. Soc. Am. A 23 (2006) 2578–2591.doi:10.1364/JOSAA.23.002578

  43. [43]

    N. B. Piller, Influence of the edge meshes on the accuracy of the coupled- dipole approximation, Opt. Lett. 22 (1997) 1674–1676.doi:10.1364/ OL.22.001674

  44. [44]

    M. A. Yurkin, M. Kahnert, Light scattering by a cube: accuracy lim- its of the discrete dipole approximation and the T-matrix method, J. Quant. Spectrosc. Radiat. Transfer 123 (2013) 176–183.doi:10.1016/ j.jqsrt.2012.10.001

  45. [45]

    Yurkin, Erratum to ”discrete dipole approximation” [light, plasmon- ics and particles, ed

    M. Yurkin, Erratum to ”discrete dipole approximation” [light, plasmon- ics and particles, ed. menguc m.p. and francoeur m., (2025) 167–198 doi:10.13140/RG.2.2.18469.61920

  46. [46]

    J. D. Jackson, Classical Electrodynamics, 3rd Edition, Wiley, New York, 1998

  47. [47]

    B. T. Draine, P. J. Flatau, User guide for the discrete dipole approxi- mation code ddscat 7.3 (2013).arXiv:1305.6497

  48. [48]

    Lakhtakia, Strong and weak forms of the method of moments and the coupled dipole method for scattering of time-harmonic elec- tromagnetic fields, Int

    A. Lakhtakia, Strong and weak forms of the method of moments and the coupled dipole method for scattering of time-harmonic elec- tromagnetic fields, Int. J. Mod. Phys. C 3 (1992).doi:10.1142/ S0129183192000385

  49. [49]

    M. A. Yurkin, A. G. Hoekstra, User manual for the discrete dipole approximation code ADDA 1.4.0 (2020). URLhttps://github.com/adda-team/adda/raw/v1.4.0/doc/ manual.pdf

  50. [50]

    Strakoˇ s, J

    Z. Strakoˇ s, J. Liesen, On numerical stability in large scale linear alge- braic computations, ZAMM 85 (2005) 307–325.doi:10.1002/zamm. 200410185

  51. [51]

    Argentin, ddscatcli: v1.0.4, Zenodo (Feb

    C. Argentin, ddscatcli: v1.0.4, Zenodo (Feb. 2026).doi:10.5281/ zenodo.18619006. URLhttps://github.com/Argencle/ddscatcli 53

  52. [52]

    M. I. Mishchenko, Scale invariance rule in electromagnetic scattering, J. Quant. Spectrosc. Radiat. Transfer 101 (2006) 411–415.doi:10.1016/ j.jqsrt.2006.02.047

  53. [53]

    H. A. van der Vorst, Bi-cgstab: A fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems, SIAM J. Sci. Stat. Comput 13 (1992) 631–644.doi:10.1137/0913035

  54. [54]

    R. D. d. Cunha, T. Hopkins, The parallel iterative methods (pim) pack- age for the solution of systems of linear equations on parallel computers, Appl. Numer. Math. 19 (1995) 33–50.doi:10.1016/0168-9274(95) 00017-O

  55. [55]

    J. M. Donald, A. Golden, S. G. Jennings, Opendda: a novel high- performance computational framework for the discrete dipole approx- imation, Int. J. High Perform. Comput. Appl. 23 (2009) 42–61.doi: 10.1177/1094342008097

  56. [56]

    J. J. Goodman, B. T. Draine, P. J. Flatau, Application of fast-fourier- transform techniques to the discrete-dipole approximation, Opt. Lett. 16 (1991) 1198–1200.doi:10.1364/OL.16.001198

  57. [57]

    A. G. Hoekstra, M. D. Grimminck, P. M. A. Sloot, Large scale sim- ulations of elastic light scattering by a fast discrete dipole approx- imation, Int. J. Mod. Phys. C 09 (1998) 87–102.doi:10.1142/ S012918319800008X

  58. [58]

    Strack, C

    A. Strack, C. Taylor, D. Pfl¨ uger, Parallel fftw on risc-v: A comparative study including openmp, mpi, and hpx (2025).arXiv:2506.08653

  59. [59]

    M. A. Yurkin, D. A. Smunev, S. A. Glukhova, A. A. Kichigin, A. E. Moskalensky, K. G. Inzhevatkin, C. Argentin, P. Bouillon, Recent de- velopments of the ADDA code, in: The 21th Electromagnetic and Light Scattering Conference, University of Messina, Milazzo, Italy, 2025, p. 36. URLhttps://hal.science/hal-05142352

  60. [60]

    B. Li, S. Cheng, J. Lin, tcfft: Accelerating half-precision fft through tensor cores (2021).arXiv:2104.11471. 54

  61. [61]

    B. E. Barrowes, F. L. Teixeira, J. A. Kong, Fast algorithm for matrix– vector multiply of asymmetric multilevel block-toeplitz matrices in 3- d scattering, Microwave Opt. Technol. Lett. 31 (2001) 28–32.doi: 10.1002/mop.1348

  62. [62]

    Shabaninezhad, M

    M. Shabaninezhad, M. Awan, G. Ramakrishna, Matlab package for dis- crete dipole approximation by graphics processing unit: Fast fourier transform and biconjugate gradient, J. Quant. Spectrosc. Radiat. Trans- fer 262 (2021) 107501.doi:10.1016/j.jqsrt.2020.107501

  63. [63]

    Roofline: An insightful visual performance model for multicore architectures,

    S. Williams, A. Waterman, D. Patterson, Roofline: an insightful vi- sual performance model for multicore architectures, Commun. ACM 52 (2009) 65–76.doi:10.1145/1498765.149878

  64. [64]

    Porumbescu, and John D

    A. Ayala, S. Tomov, M. Stoyanov, A. Haidar, J. Dongarra, Performance analysis of parallel fft on large multi-gpu systems, in: 2022 IEEE In- ternational Parallel and Distributed Processing Symposium Workshops, IEEE, 2022, pp. 372–381.doi:10.1109/IPDPSW55747.2022.00072

  65. [65]

    Steinbach, M

    P. Steinbach, M. Werner, gearshifft – the fft benchmark suite for het- erogeneous platforms, in: High Performance Computing: 32nd Interna- tional Conference, Springer-Verlag, Berlin, Heidelberg, 2017, p. 199–216. doi:10.1007/978-3-319-58667-0_11

  66. [66]

    D. Xu, P. Tuersun, S. Li, M. Wang, L. Jiang, Cpdda: A python package for discrete dipole approximation accelerated by cupy, Nanomater. 15 (2025) 500.doi:10.3390/nano15070500

  67. [67]

    B. T. Draine, J. Goodman, Beyond clausius-mossotti-wave propagation on a polarizable point lattice and the discrete dipole approximation, Astrophys. J. 405 (1993) 685–697.doi:10.1086/172396

  68. [68]

    J. D. Fenton, R. S. Gardiner-Garden, Rapidly-convergent methods for evaluating elliptic integrals and theta and elliptic functions, J. Aust. Math. Soc. B 24 (1982) 47–58.doi:10.1017/S0334270000003301

  69. [69]

    Nijboer, F

    B. Nijboer, F. De Wette, The internal field in dipole lattices, Physica 24 (1958) 422–431.doi:10.1016/S0031-8914(58)95803-8

  70. [70]

    NIST Digital Library of Mathematical Functions, Section 8.4: Incom- plete gamma and related functions,https://dlmf.nist.gov/8.4, F. 55 W. J. Olver, A. B. Olde Daalhuis, D. W. Lozier, B. I. Schneider, R. F. Boisvert, C. W. Clark, B. R. Miller, B. V. Saunders, H. S. Cohl, and M. A. McClain, eds. Release 1.0.24 (2019-09-15)

  71. [71]

    D. W. Mackowski, Discrete dipole moment method for calculation of the t matrix for nonspherical particles, J. Opt. Soc. Am. A 19 (2002) 881–893.doi:10.1364/JOSAA.19.000881

  72. [72]

    Propagation of Electromagnetic Waves on a Rectangular Lattice of Polarizable Points

    D. Gutkowicz-Krusin, B. T. Draine, Propagation of electromagnetic waves on a rectangular lattice of polarizable points (2004).arXiv: astro-ph/0403082

  73. [73]

    Argentin, dda-bench: v0.1.1, Zenodo (Mar

    C. Argentin, dda-bench: v0.1.1, Zenodo (Mar. 2026).doi:https:// doi.org/10.5281/zenodo.18836855. URLhttps://github.com/Argencle/dda-bench

  74. [74]

    Evaluation of Higher- Order Quadrature Schemes in Improving Computational Efficiency for Orientation-Averaged Single-Scattering Properties of Nonspherical Ice Particles

    M. A. Yurkin, Fair Evaluation of Orientation-Averaging Techniques in Light-Scattering Simulations: Comment on “Evaluation of Higher- Order Quadrature Schemes in Improving Computational Efficiency for Orientation-Averaged Single-Scattering Properties of Nonspherical Ice Particles” by Fenni et al, J. Geophys. Res.: Atmos. 128 (2023).doi: 10.1029/2021JD036088

  75. [75]

    C. F. Bohren, D. R. Huffman, Absorption and scattering of light by small particles, John Wiley & Sons, 1983

  76. [76]

    P. C. Chaumet, M. Nieto-Vesperinas, Time-averaged total force on a dipolar sphere in an electromagnetic field, Opt. Lett. 25 (2000) 1065– 1067.doi:10.1364/OL.25.001065

  77. [77]

    B. T. Draine, J. C. Weingartner, Radiative Torques on Interstellar Grains. I. Superthermal Spin-up, Astrophys. J. 470 (1996) 551.doi: 10.1086/177887. 56