Atompack delivers 96x faster shuffled reads and 79% smaller artifacts than ASE LMDB baselines for complete-record atomistic ML training workloads.
Lemat-bulk: aggregating, and de-duplicating quantum chemistry materials databases, 2025
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Atompack: A Storage and Distribution Layer for Read-Heavy Atomistic ML Training Datasets
Atompack delivers 96x faster shuffled reads and 79% smaller artifacts than ASE LMDB baselines for complete-record atomistic ML training workloads.