Using Binary File Format Description Languages for Documenting, Parsing, and Verifying Raw Data in TAIGA Experiment
read the original abstract
The paper is devoted to the issues of raw binary data documenting, parsing and verifying in astroparticle data lifecycle. The long-term preservation of raw data of astroparticle experiments as originally generated is essential for re-running analyses and reproducing research results. The selected high-quality raw data should have detailed documentation and accompanied by open software tools for access to them. We consider applicability of binary file format description languages to specify, parse and verify raw data of the Tunka Advanced Instrument for cosmic rays and Gamma Astronomy (TAIGA) experiment. The formal specifications are implemented for five data formats of the experiment and provide automatic generation of source code for data reading libraries in target programming languages (e.g. C++, Java, and Python). These libraries were tested on TAIGA data. They showed a good performance and help us to locate the parts with corrupted data. The format specifications can be used as metadata for exchanging of astroparticle raw data. They can also simplify software development for data aggregation from various sources for the multi-messenger analysis.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Development of a data infrastructure for a global data and analysis center in astroparticle physics
GRADLCI develops a distributed data management system for open access to KASCADE and Tunka-133 cosmic-ray data to support joint analysis.
-
Metadata Extraction from Raw Astroparticle Data of TAIGA Experiment
An extensible metadata extractor concept is presented to automatically collect and unify descriptive metadata from all TAIGA raw data formats.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.