{"paper":{"title":"CMS Analysis and Data Reduction with Apache Spark","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"","cross_cats":[],"primary_cat":"cs.DC","authors_text":"(2) Fermi National Accelerator Laboratory, (3) Flatiron Institute of the Sions Foundation, (4) Intel Corp., (5) Princeton University, Alexey Svyatkovskiy (5) ((1) European Organization for Nuclear Research CERN, Batavia, Bo Jayatilaka (2), Evangelos Motesnitsalis (1), Geneva, Ian Fisk (3), IL, Illia Cremer (4), Jim Kowalkowski (2), Jim Pivarski (5), Kacper Surdy (1), Luca Canali (1), Maria Girone (1), Matteo Cremonesi (2), New York, NJ, NY, Oliver Gutsche (2), Peter Elmer (5), Princeton, Saba Sehrish (2), Switzerland, USA, USA), Viktor Khristenko (1)","submitted_at":"2017-10-31T16:25:40Z","abstract_excerpt":"Experimental Particle Physics has been at the forefront of analyzing the world's largest datasets for decades. The HEP community was among the first to develop suitable software and computing tools for this task. In recent times, new toolkits and systems for distributed data processing, collectively called \"Big Data\" technologies have emerged from industry and open source projects to support the analysis of Petabyte and Exabyte datasets in industry. While the principles of data analysis in HEP have not changed (filtering and transforming experiment-specific data formats), these new technologie"},"claims":{"count":0,"items":[],"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"source":{"id":"1711.00375","kind":"arxiv","version":1},"verdict":{"id":null,"model_set":{},"created_at":null,"strongest_claim":"","one_line_summary":"","pipeline_version":null,"weakest_assumption":"","pith_extraction_headline":""},"references":{"count":0,"sample":[],"resolved_work":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57","internal_anchors":0},"formal_canon":{"evidence_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"}