pith. sign in

arxiv: 2604.12604 · v2 · submitted 2026-04-14 · ⚛️ physics.optics

Open dataset for benchmarking scaling laws of high-energy laser atmospheric propagation

Pith reviewed 2026-05-10 15:01 UTC · model grok-4.3

classification ⚛️ physics.optics
keywords high-energy laseratmospheric propagationturbulencethermal bloomingscaling lawsopen datasetwave-optics simulationbenchmark
0
0 comments X

The pith

A public dataset of 226,500 high-energy laser simulations enables reproducible scaling-law calibration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper releases an open collection of 226,500 wave-optics simulations of high-energy laser beams propagating through the atmosphere under coupled turbulence and thermal blooming. The cases span target speed, emission geometry, aperture diameter, visibility, aerosol model, beam quality, turbulence strength, and laser power. Data are organized as a main parameter table linked to indexed long-exposure irradiance arrays plus centralized metadata, allowing statistical analysis while preserving access to the underlying field outputs. The release replaces private simulation collections with a shared benchmark so that scaling-law surrogates can be calibrated, compared, and improved on identical reference runs generated by a split-step pipeline whose component models were checked against established propagation references.

Core claim

The central claim is the creation and public release of a dataset containing 226,500 simulation cases of high-energy laser atmospheric propagation that include coupled turbulence and thermal blooming. Each case is produced by a split-step wave-optics pipeline whose turbulence, attenuation, and thermal-blooming modules have been validated against reference propagations. The data structure consists of a case-level main table connected to indexed long-exposure irradiance arrays and supporting metadata, supporting direct use for scaling-law calibration, surrogate training, sensitivity analysis, and inverse studies without requiring regeneration of the high-fidelity fields.

What carries the argument

The open dataset of 226,500 propagation cases organized as a parameter table linked to irradiance arrays, generated by a validated split-step wave-optics pipeline that couples turbulence, attenuation, and thermal-blooming effects.

If this is right

  • Scaling-law formulations can be calibrated and directly compared using a common public reference set instead of private collections.
  • Surrogate models can be trained and tested on the provided irradiance arrays and metadata for a wide range of operating conditions.
  • Sensitivity of beam quality to individual parameters such as turbulence strength or aerosol type can be quantified statistically across the sampled space.
  • Inverse studies that infer atmospheric or system parameters from observed irradiance patterns become feasible with the large number of cases.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The shared structure could reduce redundant generation of simulation data across different research groups working on laser propagation.
  • The dataset format supports community extensions such as additional propagation effects if the underlying pipeline remains reproducible.
  • Meta-analyses comparing multiple surrogate models on identical inputs become practical for the first time.

Load-bearing premise

The split-step wave-optics models for turbulence, attenuation, and thermal blooming produce outputs accurate enough across the full parameter space to serve as reliable reference data for scaling-law calibration.

What would settle it

An independent wave-optics code run on a representative subset of the input parameters produces long-exposure irradiance statistics that differ systematically from the dataset outputs beyond the tolerance needed for surrogate calibration.

read the original abstract

Scaling laws are increasingly used as fast surrogate models for high energy laser atmospheric propagation, yet their calibration and comparison still depend on large collections of high-fidelity wave-optics simulations. Existing studies usually rely on privately organized simulation outputs, which makes it difficult to reproduce published fits or evaluate new surrogate formulations on a shared benchmark. We present a public simulation dataset for high energy laser atmospheric propagation with coupled turbulence and thermal blooming. The release contains 226,500 cases spanning target speed, emission geometry, aperture diameter, visibility, aerosol model, beam quality, turbulence strength, and laser power. Data are organized as a case-level main table linked to indexed long-exposure irradiance arrays and centralized metadata, which supports statistical analysis without hiding the underlying field outputs. The simulation pipeline is based on split-step wave-optics propagation with turbulence, attenuation, and thermal-blooming models that have been validated against established propagation references. The dataset is intended for scaling-law calibration, benchmark comparison, surrogate-model training, sensitivity analysis, and inverse studies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript presents a public dataset of 226,500 split-step wave-optics simulations of high-energy laser atmospheric propagation that include coupled turbulence and thermal blooming effects. The cases span parameters such as target speed, emission geometry, aperture diameter, visibility, aerosol model, beam quality, turbulence strength, and laser power. Data are released in an organized format consisting of a case-level main table, indexed long-exposure irradiance arrays, and centralized metadata, with the explicit goal of supporting scaling-law calibration, surrogate-model training, benchmark comparisons, sensitivity analysis, and inverse studies. The underlying propagation models are stated to have been validated against established references.

Significance. If the simulation outputs are sufficiently accurate, the release constitutes a valuable open benchmark that directly addresses the reproducibility barrier created by privately held simulation collections in this field. The scale (226,500 cases), breadth of parameter coverage, and provision of both summary tables and raw irradiance fields enable statistical analyses and machine-learning surrogate development that were previously difficult to perform on shared data. This is a concrete contribution to the infrastructure needed for scaling-law research.

major comments (1)
  1. [Simulation pipeline description] Simulation pipeline description: The manuscript asserts that the turbulence, attenuation, and thermal-blooming models have been validated against established propagation references, yet supplies no quantitative error metrics (e.g., RMS irradiance deviation, Strehl-ratio bias, or beam-width error) evaluated at representative points across the spanned ranges of turbulence strength, laser power, visibility, and aerosol model. Without such metrics, users cannot readily assess whether regime-specific discrepancies could systematically bias derived scaling relations.
minor comments (1)
  1. [Data release and organization] Data organization: The description of the case-level main table, indexed irradiance arrays, and metadata is clear, but a short explicit example of how a user would load and combine the files for a statistical analysis would improve immediate usability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their positive evaluation of the dataset's significance and for the constructive comment regarding the simulation pipeline. We address the point below.

read point-by-point responses
  1. Referee: [Simulation pipeline description] Simulation pipeline description: The manuscript asserts that the turbulence, attenuation, and thermal-blooming models have been validated against established propagation references, yet supplies no quantitative error metrics (e.g., RMS irradiance deviation, Strehl-ratio bias, or beam-width error) evaluated at representative points across the spanned ranges of turbulence strength, laser power, visibility, and aerosol model. Without such metrics, users cannot readily assess whether regime-specific discrepancies could systematically bias derived scaling relations.

    Authors: We agree that the absence of quantitative error metrics limits users' ability to evaluate potential regime-specific biases. The turbulence, attenuation, and thermal-blooming models are taken from our prior validated implementations, where comparisons to established references (split-step Fourier benchmarks and analytic limits) were reported with metrics such as Strehl-ratio error and beam-width deviation. These earlier validations, however, were not reproduced or extended across the full parameter space of the present dataset. To remedy this, the revised manuscript will include a new subsection summarizing representative quantitative metrics (RMS irradiance deviation, Strehl-ratio bias, and beam-width error) evaluated at selected points spanning the ranges of Cn^{2}, laser power, visibility, and aerosol models. These metrics will be generated from additional runs using the identical pipeline, enabling users to assess fidelity for scaling-law applications. revision: yes

Circularity Check

0 steps flagged

No circularity: dataset release is self-contained with no derived predictions or self-referential fits.

full rationale

The manuscript releases a fixed collection of 226,500 pre-computed split-step wave-optics cases and organizes them for external use in scaling-law calibration. No scaling law, surrogate model, or fitted parameter is derived from these outputs inside the paper. The only load-bearing statement is that the underlying turbulence, attenuation, and thermal-blooming models “have been validated against established propagation references,” which is an external assertion rather than a derivation performed on the released data. Because the paper performs no fitting, prediction, or self-referential reduction of any claimed result to its own inputs, the derivation chain contains no circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The paper rests on standard wave-optics propagation methods and previously validated turbulence and thermal-blooming models. No new free parameters, axioms, or invented entities are introduced by the dataset release itself.

axioms (1)
  • domain assumption Split-step Fourier propagation accurately models linear and nonlinear atmospheric effects when combined with established turbulence and thermal-blooming sub-models.
    Invoked in the abstract when stating the simulation pipeline.

pith-pipeline@v0.9.0 · 5470 in / 1309 out tokens · 38013 ms · 2026-05-10T15:01:26.847550+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

1 extracted references · 1 canonical work pages

  1. [1]

    Self-induced thermal distortion in the near field for a laser beam in a moving medium,

    1 F. Gebhardt and D. Smith, “Self-induced thermal distortion in the near field for a laser beam in a moving medium,”IEEE Journal of Quantum Electronics7, 63–73 (1971). 2 Frederick G. Gebhardt, “High power laser propagation,”Applied Optics15, 1479 (1976). 3 D.C. Smith, “High-power laser propagation: Thermal blooming,”Proceedings of the IEEE65(12), 1679–171...