pith. sign in

JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

In applications of machine learning to particle physics, a persistent challenge is how to go beyond discrimination to learn about the underlying physics. To this end, a powerful tool would be a framework for unsupervised learning, where the machine learns the intricate high-dimensional contours of the data upon which it is trained, without reference to pre-established labels. In order to approach such a complex task, an unsupervised network must be structured intelligently, based on a qualitative understanding of the data. In this paper, we scaffold the neural network's architecture around a leading-order model of the physics underlying the data. In addition to making unsupervised learning tractable, this design actually alleviates existing tensions between performance and interpretability. We call the framework JUNIPR: "Jets from UNsupervised Interpretable PRobabilistic models". In this approach, the set of particle momenta composing a jet are clustered into a binary tree that the neural network examines sequentially. Training is unsupervised and unrestricted: the network could decide that the data bears little correspondence to the chosen tree structure. However, when there is a correspondence, the network's output along the tree has a direct physical interpretation. JUNIPR models can perform discrimination tasks, through the statistically optimal likelihood-ratio test, and they permit visualizations of discrimination power at each branching in a jet's tree. Additionally, JUNIPR models provide a probability distribution from which events can be drawn, providing a data-driven Monte Carlo generator. As a third application, JUNIPR models can reweight events from one (e.g. simulated) data set to agree with distributions from another (e.g. experimental) data set.

fields

hep-ph 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Neural Scaling Laws for Jet Generation

hep-ph · 2026-05-27 · unverdicted · novelty 7.0

Scaling laws hold logarithmically for model size in autoregressive jet generation, with next-token loss correlating to physical metrics via sliced Wasserstein distance, but show weaker scaling for dataset size and compute due to rapid saturation.

citing papers explorer

Showing 2 of 2 citing papers.

  • Neural Scaling Laws for Jet Generation hep-ph · 2026-05-27 · unverdicted · none · ref 38 · internal anchor

    Scaling laws hold logarithmically for model size in autoregressive jet generation, with next-token loss correlating to physical metrics via sliced Wasserstein distance, but show weaker scaling for dataset size and compute due to rapid saturation.

  • Stress testing of fast reconstruction pipelines using machine learning hep-ph · 2026-06-16 · unverdicted · none · ref 5 · internal anchor

    Context-aware stress testing reveals that the local assumption fails for Z→ℓℓ reconstruction at HL-LHC, producing bias and degraded resolution that an unsupervised regime-mapping framework then corrects.