A universal vision transformer for fast calorimeter simulations

Andrea Giammanco; Claudius Krause; Luigi Favaro

arxiv: 2601.05289 · v2 · pith:IDA6ZYH6new · submitted 2026-01-07 · ✦ hep-ph · cs.LG· hep-ex· physics.ins-det

A universal vision transformer for fast calorimeter simulations

Luigi Favaro , Andrea Giammanco , Claudius Krause This is my paper

classification ✦ hep-ph cs.LGhep-exphysics.ins-det

keywords vitscalorimeterdetectorsfastgeant4geometriesmultipleregular

0 comments

read the original abstract

The high-dimensional complex nature of detectors makes fast calorimeter simulations a prime application for modern generative machine learning. Vision transformers (ViTs) can emulate the Geant4 response with unmatched accuracy and are not limited to regular geometries. Starting from the CaloDREAM architecture, we demonstrate the robustness and scalability of ViTs on regular and irregular geometries, and multiple detectors. Our results show that ViTs generate electromagnetic and hadronic showers with minimal deviations from Geant4 in multiple evaluation metrics, while maintaining the generation time in the $\mathcal{O}(10-100)$ ms on a single GPU. Furthermore, we show that pretraining on a large dataset and fine-tuning on the target geometry leads to reduced training costs and higher data efficiency, or altogether improves the fidelity of generated showers.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CaloArt: Large-Patch x-Prediction Diffusion Transformers for High-Granularity Calorimeter Shower Generation
physics.ins-det 2026-05 unverdicted novelty 5.0

CaloArt achieves top FPD, high-level, and classifier metrics on CaloChallenge datasets 2 and 3 while keeping single-GPU generation at 9-11 ms per shower by combining large-patch tokenization, x-prediction, and conditi...