pith. sign in

arxiv: 2107.00656 · v2 · pith:CIJDPXH2new · submitted 2021-07-01 · 💻 cs.LG · astro-ph.IM· hep-ph· nucl-th· physics.data-an· stat.ML

Shared Data and Algorithms for Deep Learning in Fundamental Physics

classification 💻 cs.LG astro-ph.IMhep-phnucl-thphysics.data-anstat.ML
keywords physicsdatasetsfundamentallearningalgorithmsdatagraph-basedhadronic
0
0 comments X
read the original abstract

We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level histories. While public datasets from multiple fundamental physics disciplines already exist, the common interface and provided reference models simplify future work on cross-disciplinary machine learning and transfer learning in fundamental physics. We discuss the design and structure and line out how additional datasets can be submitted for inclusion. As showcase application, we present a simple yet flexible graph-based neural network architecture that can easily be applied to a wide range of supervised learning tasks. We show that our approach reaches performance close to dedicated methods on all datasets. To simplify adaptation for various problems, we provide easy-to-follow instructions on how graph-based representations of data structures, relevant for fundamental physics, can be constructed and provide code implementations for several of them. Implementations are also provided for our proposed method and all reference algorithms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Local Conformal Predictions for Calibrated Surrogates

    hep-ph 2026-07 unverdicted novelty 7.0

    FALCON is a novel conformal prediction technique that learns locally calibrated confidence intervals for neural network surrogates modeling LHC scattering amplitudes.