Open dataset for benchmarking scaling laws of high-energy laser atmospheric propagation
Pith reviewed 2026-05-10 15:01 UTC · model grok-4.3
The pith
A public dataset of 226,500 high-energy laser simulations enables reproducible scaling-law calibration.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is the creation and public release of a dataset containing 226,500 simulation cases of high-energy laser atmospheric propagation that include coupled turbulence and thermal blooming. Each case is produced by a split-step wave-optics pipeline whose turbulence, attenuation, and thermal-blooming modules have been validated against reference propagations. The data structure consists of a case-level main table connected to indexed long-exposure irradiance arrays and supporting metadata, supporting direct use for scaling-law calibration, surrogate training, sensitivity analysis, and inverse studies without requiring regeneration of the high-fidelity fields.
What carries the argument
The open dataset of 226,500 propagation cases organized as a parameter table linked to irradiance arrays, generated by a validated split-step wave-optics pipeline that couples turbulence, attenuation, and thermal-blooming effects.
If this is right
- Scaling-law formulations can be calibrated and directly compared using a common public reference set instead of private collections.
- Surrogate models can be trained and tested on the provided irradiance arrays and metadata for a wide range of operating conditions.
- Sensitivity of beam quality to individual parameters such as turbulence strength or aerosol type can be quantified statistically across the sampled space.
- Inverse studies that infer atmospheric or system parameters from observed irradiance patterns become feasible with the large number of cases.
Where Pith is reading between the lines
- The shared structure could reduce redundant generation of simulation data across different research groups working on laser propagation.
- The dataset format supports community extensions such as additional propagation effects if the underlying pipeline remains reproducible.
- Meta-analyses comparing multiple surrogate models on identical inputs become practical for the first time.
Load-bearing premise
The split-step wave-optics models for turbulence, attenuation, and thermal blooming produce outputs accurate enough across the full parameter space to serve as reliable reference data for scaling-law calibration.
What would settle it
An independent wave-optics code run on a representative subset of the input parameters produces long-exposure irradiance statistics that differ systematically from the dataset outputs beyond the tolerance needed for surrogate calibration.
read the original abstract
Scaling laws are increasingly used as fast surrogate models for high energy laser atmospheric propagation, yet their calibration and comparison still depend on large collections of high-fidelity wave-optics simulations. Existing studies usually rely on privately organized simulation outputs, which makes it difficult to reproduce published fits or evaluate new surrogate formulations on a shared benchmark. We present a public simulation dataset for high energy laser atmospheric propagation with coupled turbulence and thermal blooming. The release contains 226,500 cases spanning target speed, emission geometry, aperture diameter, visibility, aerosol model, beam quality, turbulence strength, and laser power. Data are organized as a case-level main table linked to indexed long-exposure irradiance arrays and centralized metadata, which supports statistical analysis without hiding the underlying field outputs. The simulation pipeline is based on split-step wave-optics propagation with turbulence, attenuation, and thermal-blooming models that have been validated against established propagation references. The dataset is intended for scaling-law calibration, benchmark comparison, surrogate-model training, sensitivity analysis, and inverse studies.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a public dataset of 226,500 split-step wave-optics simulations of high-energy laser atmospheric propagation that include coupled turbulence and thermal blooming effects. The cases span parameters such as target speed, emission geometry, aperture diameter, visibility, aerosol model, beam quality, turbulence strength, and laser power. Data are released in an organized format consisting of a case-level main table, indexed long-exposure irradiance arrays, and centralized metadata, with the explicit goal of supporting scaling-law calibration, surrogate-model training, benchmark comparisons, sensitivity analysis, and inverse studies. The underlying propagation models are stated to have been validated against established references.
Significance. If the simulation outputs are sufficiently accurate, the release constitutes a valuable open benchmark that directly addresses the reproducibility barrier created by privately held simulation collections in this field. The scale (226,500 cases), breadth of parameter coverage, and provision of both summary tables and raw irradiance fields enable statistical analyses and machine-learning surrogate development that were previously difficult to perform on shared data. This is a concrete contribution to the infrastructure needed for scaling-law research.
major comments (1)
- [Simulation pipeline description] Simulation pipeline description: The manuscript asserts that the turbulence, attenuation, and thermal-blooming models have been validated against established propagation references, yet supplies no quantitative error metrics (e.g., RMS irradiance deviation, Strehl-ratio bias, or beam-width error) evaluated at representative points across the spanned ranges of turbulence strength, laser power, visibility, and aerosol model. Without such metrics, users cannot readily assess whether regime-specific discrepancies could systematically bias derived scaling relations.
minor comments (1)
- [Data release and organization] Data organization: The description of the case-level main table, indexed irradiance arrays, and metadata is clear, but a short explicit example of how a user would load and combine the files for a statistical analysis would improve immediate usability.
Simulated Author's Rebuttal
We thank the referee for their positive evaluation of the dataset's significance and for the constructive comment regarding the simulation pipeline. We address the point below.
read point-by-point responses
-
Referee: [Simulation pipeline description] Simulation pipeline description: The manuscript asserts that the turbulence, attenuation, and thermal-blooming models have been validated against established propagation references, yet supplies no quantitative error metrics (e.g., RMS irradiance deviation, Strehl-ratio bias, or beam-width error) evaluated at representative points across the spanned ranges of turbulence strength, laser power, visibility, and aerosol model. Without such metrics, users cannot readily assess whether regime-specific discrepancies could systematically bias derived scaling relations.
Authors: We agree that the absence of quantitative error metrics limits users' ability to evaluate potential regime-specific biases. The turbulence, attenuation, and thermal-blooming models are taken from our prior validated implementations, where comparisons to established references (split-step Fourier benchmarks and analytic limits) were reported with metrics such as Strehl-ratio error and beam-width deviation. These earlier validations, however, were not reproduced or extended across the full parameter space of the present dataset. To remedy this, the revised manuscript will include a new subsection summarizing representative quantitative metrics (RMS irradiance deviation, Strehl-ratio bias, and beam-width error) evaluated at selected points spanning the ranges of Cn^{2}, laser power, visibility, and aerosol models. These metrics will be generated from additional runs using the identical pipeline, enabling users to assess fidelity for scaling-law applications. revision: yes
Circularity Check
No circularity: dataset release is self-contained with no derived predictions or self-referential fits.
full rationale
The manuscript releases a fixed collection of 226,500 pre-computed split-step wave-optics cases and organizes them for external use in scaling-law calibration. No scaling law, surrogate model, or fitted parameter is derived from these outputs inside the paper. The only load-bearing statement is that the underlying turbulence, attenuation, and thermal-blooming models “have been validated against established propagation references,” which is an external assertion rather than a derivation performed on the released data. Because the paper performs no fitting, prediction, or self-referential reduction of any claimed result to its own inputs, the derivation chain contains no circular steps.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Split-step Fourier propagation accurately models linear and nonlinear atmospheric effects when combined with established turbulence and thermal-blooming sub-models.
Reference graph
Works this paper leans on
-
[1]
Self-induced thermal distortion in the near field for a laser beam in a moving medium,
1 F. Gebhardt and D. Smith, “Self-induced thermal distortion in the near field for a laser beam in a moving medium,”IEEE Journal of Quantum Electronics7, 63–73 (1971). 2 Frederick G. Gebhardt, “High power laser propagation,”Applied Optics15, 1479 (1976). 3 D.C. Smith, “High-power laser propagation: Thermal blooming,”Proceedings of the IEEE65(12), 1679–171...
work page 1971
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.