Empirical study of agentic LLM generation of parallel Julia code finds reliable execution only at small scales with recurring failures in task dependencies and scheduling at larger scales.
Madgraph on GPUs and vector CPUs: Towards production. The 5- year journey to the first LO release CUDACPP v1.00.00
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
method 1polarities
use method 1representative citing papers
FPGA implementations for full matrix-element workflow on e+e- to mu+mu- and color-algebra kernels on gg to ttbar+X achieve speedups and energy gains over CPU/GPU while preserving numerical accuracy.
A cascade pipeline on 400 AIE tiles evaluates gg→ttg leading-order matrix elements at 1 million per second with parts-per-million accuracy to MadGraph, delivering 34× CPU speedup and 7.7× better energy efficiency at 54.8 W.
citing papers explorer
-
FPGA Acceleration of Matrix-Element Calculations for Monte Carlo Event Generation
FPGA implementations for full matrix-element workflow on e+e- to mu+mu- and color-algebra kernels on gg to ttbar+X achieve speedups and energy gains over CPU/GPU while preserving numerical accuracy.
-
Cascade Pipeline for Leading-Order Matrix Element Evaluation on AMD Versal AI Engine Arrays
A cascade pipeline on 400 AIE tiles evaluates gg→ttg leading-order matrix elements at 1 million per second with parts-per-million accuracy to MadGraph, delivering 34× CPU speedup and 7.7× better energy efficiency at 54.8 W.