Hybrid deep learning-based phase diversity method for wavefront reconstruction

A. Kotov; A. Soloviev; I. Meyerov; K. Burdonov; S. Perevalov; V. Volokitin; Y. Rodimkov

arxiv: 2606.25855 · v1 · pith:XWVKVZDTnew · submitted 2026-06-24 · ⚛️ physics.optics · cs.CV· physics.app-ph

Hybrid deep learning-based phase diversity method for wavefront reconstruction

Y. Rodimkov , A. Kotov , K. Burdonov , S. Perevalov , V. Volokitin , I. Meyerov , A. Soloviev This is my paper

Pith reviewed 2026-06-25 20:03 UTC · model grok-4.3

classification ⚛️ physics.optics cs.CVphysics.app-ph

keywords wavefront reconstructionphase diversityhybrid deep learningadaptive optics calibrationL-BFGS optimizationconvolutional neural networklaser beam qualityStrehl ratio

0 comments

The pith

A hybrid CNN initial guess followed by L-BFGS refinement reconstructs wavefront distortions to high efficiency in both simulation and experiment.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to show that pairing a convolutional neural network for a fast initial wavefront estimate with the L-BFGS optimizer for refinement overcomes the speed-accuracy trade-off in phase-diversity calibration of adaptive optics. Conventional iterative methods are slow and sensitive to starting points, while pure deep-learning models lack final accuracy. The hybrid approach is tested across a range of root-mean-square distortions in numerical cases and in a physical laser setup, producing concrete efficiency and Strehl-ratio numbers after only a few optimization steps. A sympathetic reader would care because successful fast calibration directly raises the peak intensity delivered by high-power laser systems.

Core claim

The hybrid method combines a convolutional neural network that supplies an initial wavefront estimate with the L-BFGS algorithm that refines it; numerical trials reach an efficiency of approximately 0.99 in 80 percent of cases for RMS distortions from 0 to 1.3 lambda, while physical experiments on distortions of 0.15 to 0.6 lambda yield an efficiency near 0.75 and a Strehl ratio of 0.96 plus or minus 0.02 after two to four L-BFGS iterations.

What carries the argument

The hybrid pipeline in which the CNN produces the starting wavefront map and L-BFGS performs the subsequent phase-diversity minimization.

If this is right

Numerical performance holds across RMS distortions up to 1.3 lambda with efficiency near 0.99 in most trials.
Physical tests confirm the method works for RMS values 0.15-0.6 lambda and produces Strehl ratios of 0.96 plus or minus 0.02.
Calibration of adaptive optics finishes in 2-4 iterations under the reported conditions.
The approach is positioned as suitable for real-time or near-real-time calibration of high-power laser systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the CNN training set is expanded to cover larger distortion ranges, the same hybrid structure could extend the reliable RMS interval beyond 1.3 lambda.
Replacing L-BFGS with a different local optimizer might change the number of iterations needed once the CNN seed is fixed.
The reported physical efficiency of 0.75 suggests the method may still benefit from further CNN accuracy gains before it matches simulation results.
Success on non-common-path aberrations implies the pipeline could be tested on other phase-retrieval tasks that currently rely on pure optimization.

Load-bearing premise

The CNN's first estimate lies close enough to the true wavefront that L-BFGS reaches a high-efficiency solution in only a few iterations without becoming trapped in poor local minima.

What would settle it

A set of test wavefronts where the hybrid method consistently requires more than four L-BFGS iterations or ends with efficiency below 0.7 even when the CNN is trained on the same distribution of distortions.

Figures

Figures reproduced from arXiv: 2606.25855 by A. Kotov, A. Soloviev, I. Meyerov, K. Burdonov, S. Perevalov, V. Volokitin, Y. Rodimkov.

**Figure 3.** Figure 3: The final dataset consisted of training set of 100,000 [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 2.** Figure 2: FIG. 2: Maximum and minimum bounds for the Zernike [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4: Architecture of the modified U-Net applied for the [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5: Schematic of the workflow of the hybrid method. [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 6.** Figure 6: FIG. 6: Dependence of algorithm efficiency on distortion [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: FIG. 7: Schematic of the deformable mirror and the [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: FIG. 8: Schematic of the experimental setup. DM: deformable mirror; OAP: off-axis parabolic mirror; MO: microscope [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗

**Figure 9.** Figure 9: FIG. 9: Experimental results of the iterative wavefront correction based on the predictions of the hybrid method. The left axis [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗

**Figure 10.** Figure 10: FIG. 10: Efficiency of the hybrid method on experimental [PITH_FULL_IMAGE:figures/full_fig_p011_10.png] view at source ↗

read the original abstract

The efficiency of high-power laser systems is limited by wavefront distortions in the beam, particularly non-common path aberrations, which reduce the peak intensity at the focal plane. Compensating for these aberrations requires the calibration of the adaptive optics system. Conventional calibration methods rely on a time-consuming iterative optimization that is highly sensitive to initial conditions. While deep learning-based models offer high speed, they often demonstrate insufficient accuracy. In this work, we present a hybrid wavefront reconstruction method that combines a convolutional neural network to generate an initial estimate of the wavefront distortions, with the L-BFGS (Limited-memory Broyden-Fletcher-Goldfarb-Shanno) algorithm for its subsequent refinement. In numerical simulations, the method achieved an efficiency of $\sim 0.99$ in 80% of the cases for a root-mean-square (RMS) of wavefront distortions ranging from 0 to $1.3\lambda$. In a physical experiment, for initial wavefront distortions with RMS values from 0.15 to $0.6\lambda$, the method achieved an efficiency of $\sim 0.75$. As a result, focusing with a Strehl ratio of $0.96 \pm 0.02$ was attained within 2 to 4 iterations of the algorithm, confirming the applicability of the method for the fast and accurate calibration of adaptive optics systems under real experimental conditions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The hybrid CNN plus L-BFGS method reaches usable experimental Strehl ratios but the reported gains rest on an untested assumption about the quality of the CNN starting point.

read the letter

The main takeaway is that this paper pairs a convolutional neural network with L-BFGS to handle phase-diversity wavefront sensing for adaptive-optics calibration in high-power lasers. In simulations it claims ~0.99 efficiency in 80% of cases up to 1.3λ RMS; in a physical run on 0.15–0.6λ RMS distortions it reaches ~0.75 efficiency and Strehl 0.96±0.02 after only 2–4 iterations.

The experimental result is the clearest positive. Real hardware data with a reported Strehl ratio is worth having in this subfield, and the hybrid idea itself is presented as new for this calibration task.

The soft spots are straightforward. The abstract gives no definition of the efficiency metric, no error bars on it, no network architecture or training details, and no baseline runs of L-BFGS from a flat or random start. Without those, it is impossible to tell how much the CNN actually contributes versus the optimizer. The 20% of simulations that did not reach 0.99 are not analyzed, and there is no check on whether the CNN residual stays inside the capture radius of the L-BFGS objective for the aberration statistics they used. The experimental range is also narrow. These gaps make the robustness claim hard to judge from what is shown.

The work is aimed at researchers who calibrate adaptive optics on high-power laser systems. Someone in that niche can extract the experimental numbers and decide whether the speed-up is useful for their setup. It is coherent enough on its own terms to deserve a full referee, even though the current version would need added ablations and clearer metrics before acceptance.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a hybrid wavefront reconstruction method for adaptive optics calibration that uses a convolutional neural network to generate an initial estimate of wavefront distortions followed by refinement with the L-BFGS algorithm in a phase-diversity framework. It reports that numerical simulations achieve an efficiency of ~0.99 in 80% of cases for RMS distortions from 0 to 1.3λ, while physical experiments for RMS values 0.15–0.6λ reach ~0.75 efficiency and Strehl ratio 0.96±0.02 within 2–4 L-BFGS iterations.

Significance. If the performance claims hold after addressing the gaps in validation, the hybrid method could provide a practical balance of speed and accuracy for calibrating adaptive optics in high-power laser systems, reducing the iteration count compared to pure optimization while improving on pure deep-learning accuracy. The reported experimental Strehl ratio is a concrete, falsifiable outcome that would strengthen the case for real-world applicability if supported by proper controls.

major comments (2)

[Abstract] Abstract: the central performance claims (~0.99 efficiency in 80% of simulations; ~0.75 efficiency and Strehl 0.96±0.02 in experiment) rest on the untested premise that the CNN initial guess always lies inside the L-BFGS basin of attraction; no ablation (L-BFGS from zero/random phases on the same test set), no quantification of CNN residual error versus capture radius, and no failure-case analysis of the remaining 20% of simulations are described.
[Abstract] Abstract: efficiency is reported without definition, error bars, baseline comparisons (standalone L-BFGS, other DL methods, or phase-diversity variants), or details on training data and network architecture, preventing assessment of whether the hybrid combination actually improves upon either component alone.

minor comments (1)

[Abstract] Abstract: the RMS ranges are given without explicit units or wavelength reference in the experimental section, and the phrase 'within 2 to 4 iterations' should specify whether this is median, mean, or worst-case.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on validation gaps in our hybrid wavefront reconstruction method. We address each major comment below and will revise the manuscript to strengthen the supporting evidence for the reported performance.

read point-by-point responses

Referee: [Abstract] Abstract: the central performance claims (~0.99 efficiency in 80% of simulations; ~0.75 efficiency and Strehl 0.96±0.02 in experiment) rest on the untested premise that the CNN initial guess always lies inside the L-BFGS basin of attraction; no ablation (L-BFGS from zero/random phases on the same test set), no quantification of CNN residual error versus capture radius, and no failure-case analysis of the remaining 20% of simulations are described.

Authors: We agree that these validation elements are absent from the manuscript and that their inclusion would better substantiate the hybrid method's reliance on the CNN initialization. The current work does not contain the requested ablations, residual error quantification, or failure-case analysis. We will add an ablation study of L-BFGS started from random or zero phases on the same simulated test set, quantify CNN residual wavefront error relative to the optimizer's capture radius, and analyze the 20% of cases where efficiency falls below the reported threshold. These additions will be incorporated into the results and discussion sections of the revised manuscript. revision: yes
Referee: [Abstract] Abstract: efficiency is reported without definition, error bars, baseline comparisons (standalone L-BFGS, other DL methods, or phase-diversity variants), or details on training data and network architecture, preventing assessment of whether the hybrid combination actually improves upon either component alone.

Authors: We agree that the abstract (and, by extension, the level of detail provided) does not define efficiency, include error bars, present explicit baseline comparisons, or summarize training data and architecture. We will revise the abstract to define efficiency as the ratio of achieved focal-plane peak intensity to the diffraction-limited value, report associated uncertainties, add direct comparisons against standalone L-BFGS and pure CNN baselines on the same test sets, and include concise descriptions of the training dataset (Zernike-based simulated aberrations) and network architecture. These changes will allow readers to evaluate the hybrid improvement directly. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical performance metrics from simulations and experiments

full rationale

The paper presents a hybrid CNN-initialized L-BFGS method for wavefront reconstruction and directly reports measured efficiencies (~0.99 in 80% of sim cases; ~0.75 and Strehl 0.96±0.02 in physical tests) as experimental outcomes. No derivation chain, equations, or fitted parameters are shown that reduce by construction to inputs, self-citations, or ansatzes. The central claims rest on empirical validation rather than any self-referential prediction or uniqueness theorem imported from prior author work. This is the standard case of a methods paper whose results are independently falsifiable via the described experiments.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The method implicitly assumes standard phase-diversity intensity measurements suffice as input and that the wavefront can be represented by a finite set of modes.

pith-pipeline@v0.9.1-grok · 5807 in / 1218 out tokens · 20195 ms · 2026-06-25T20:03:40.138912+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

12 extracted references · 1 linked inside Pith

[1]

The tar- get wavefront distortions were normalized by a factor of 3λ, yielding a value range of approximately[−3,3], which is con- ducive to stable training of the neural network

Data preprocessing and augmentation Each distribution of the electric field amplitude FF i was independently normalized by its maximum value. The tar- get wavefront distortions were normalized by a factor of 3λ, yielding a value range of approximately[−3,3], which is con- ducive to stable training of the neural network. During infer- ence, the inverse sca...
[2]

The dataset was dynamically updated during the optimization process

Circular buffer The synthetic dataset used for model training was not static. The dataset was dynamically updated during the optimization process. The data were stored in a fixed-size circular buffer, where older examples were continuously replaced with newly generated ones50. The training procedure followed a standard pipeline. Mini-batches were sampled ...
[3]

The DM design and electrode geometry are illustrated in Fig

Experimental facility The experiments were performed using the AOS 52–54 con- sisting a of Shack-Hartmann WFS and a bimorph DM. The DM design and electrode geometry are illustrated in Fig. 7. The DM is a three-layer composite structure consisting of a polished substrate with a reflective coating and two piezoce- ramic disks. All mirror components are rigi...
[4]

The WFS measures the wavefront distortions, defined as the wavefront difference between the incident beam and the reference wavefront
[5]

The control system calculates the corresponding com- mand voltages based on the WFS measurements and applies them to the DM
[6]

The DM updates its surface profile in response to the applied control signals
[7]

Through this feedback loop, the AOS minimizes the resid- ual wavefront distortions between the incident and reference wavefronts

This cycle repeats continuously to ensure a dynamic wavefront stabilization. Through this feedback loop, the AOS minimizes the resid- ual wavefront distortions between the incident and reference wavefronts. The experiment was performed at the PEARL laser facility9. The optical scheme is depicted in Fig. 8. The output from a laser diode was expanded by a b...
[8]

Utilizing the deformable mirror allows for a highly flex- ible phase diversity scheme. For instance, diverse types and amplitudes of wavefront distortions can be applied, enabling the acquisition of intensity distributions at an arbitrary number of focal and defocused planes. The reconstruction accuracy is then primarily limited by the spatial fidelity of...
[9]

Con- ventional methods, such as introducing an auxiliary focusing channel or a beam splitter, inherently in- duce inter-channel wavefront distortions

No supplementary optical components are required in the measurement scheme, avoiding parasitic wavefront distortions and simplifying the optical alignment. Con- ventional methods, such as introducing an auxiliary focusing channel or a beam splitter, inherently in- duce inter-channel wavefront distortions. Alternatively, translating the camera along the op...
[10]

In each experiment, we introduced controlled wavefront distortions into the beam by adding combinations of Zernike modes to the wavefront

This approach seamlessly integrates into the existing AOS architecture, as it can be executed purely as a soft- ware procedure within the closed-loop control system. In each experiment, we introduced controlled wavefront distortions into the beam by adding combinations of Zernike modes to the wavefront. Zernike mode coefficients were gen- erated similar t...
[11]

Phase retrieval algorithm for jwst flight and testbed telescope,

Experimental results The performance of the hybrid method was evaluated in nine experimental series, each with different initial wavefront distortions. Each series comprised nine correction iterations based on the predictions of the hybrid method. In applica- tions aimed at maximizing the peak intensity, the most stan- dard metric for assessing the compen...

2026
[12]

Hubble space telescope characterized by using phase-retrieval algorithms,

pp. 314–330. 2J. R. Fienup, J. C. Marron, T. J. Schulz, and J. H. Seldin, “Hubble space telescope characterized by using phase-retrieval algorithms,” Applied op- tics32, 1747–1767 (1993). Hybrid deep learning-based phase diversity method for wavefront reconstruction 12 3Y . Jin, J. Chen, C. Wu, Z. Chen, X. Zhang, H.-l. Shen, W. Gong, and K. Si, “Wavefront...

Pith/arXiv arXiv 1993

[1] [1]

The tar- get wavefront distortions were normalized by a factor of 3λ, yielding a value range of approximately[−3,3], which is con- ducive to stable training of the neural network

Data preprocessing and augmentation Each distribution of the electric field amplitude FF i was independently normalized by its maximum value. The tar- get wavefront distortions were normalized by a factor of 3λ, yielding a value range of approximately[−3,3], which is con- ducive to stable training of the neural network. During infer- ence, the inverse sca...

[2] [2]

The dataset was dynamically updated during the optimization process

Circular buffer The synthetic dataset used for model training was not static. The dataset was dynamically updated during the optimization process. The data were stored in a fixed-size circular buffer, where older examples were continuously replaced with newly generated ones50. The training procedure followed a standard pipeline. Mini-batches were sampled ...

[3] [3]

The DM design and electrode geometry are illustrated in Fig

Experimental facility The experiments were performed using the AOS 52–54 con- sisting a of Shack-Hartmann WFS and a bimorph DM. The DM design and electrode geometry are illustrated in Fig. 7. The DM is a three-layer composite structure consisting of a polished substrate with a reflective coating and two piezoce- ramic disks. All mirror components are rigi...

[4] [4]

The WFS measures the wavefront distortions, defined as the wavefront difference between the incident beam and the reference wavefront

[5] [5]

The control system calculates the corresponding com- mand voltages based on the WFS measurements and applies them to the DM

[6] [6]

The DM updates its surface profile in response to the applied control signals

[7] [7]

Through this feedback loop, the AOS minimizes the resid- ual wavefront distortions between the incident and reference wavefronts

This cycle repeats continuously to ensure a dynamic wavefront stabilization. Through this feedback loop, the AOS minimizes the resid- ual wavefront distortions between the incident and reference wavefronts. The experiment was performed at the PEARL laser facility9. The optical scheme is depicted in Fig. 8. The output from a laser diode was expanded by a b...

[8] [8]

Utilizing the deformable mirror allows for a highly flex- ible phase diversity scheme. For instance, diverse types and amplitudes of wavefront distortions can be applied, enabling the acquisition of intensity distributions at an arbitrary number of focal and defocused planes. The reconstruction accuracy is then primarily limited by the spatial fidelity of...

[9] [9]

Con- ventional methods, such as introducing an auxiliary focusing channel or a beam splitter, inherently in- duce inter-channel wavefront distortions

No supplementary optical components are required in the measurement scheme, avoiding parasitic wavefront distortions and simplifying the optical alignment. Con- ventional methods, such as introducing an auxiliary focusing channel or a beam splitter, inherently in- duce inter-channel wavefront distortions. Alternatively, translating the camera along the op...

[10] [10]

In each experiment, we introduced controlled wavefront distortions into the beam by adding combinations of Zernike modes to the wavefront

This approach seamlessly integrates into the existing AOS architecture, as it can be executed purely as a soft- ware procedure within the closed-loop control system. In each experiment, we introduced controlled wavefront distortions into the beam by adding combinations of Zernike modes to the wavefront. Zernike mode coefficients were gen- erated similar t...

[11] [11]

Phase retrieval algorithm for jwst flight and testbed telescope,

Experimental results The performance of the hybrid method was evaluated in nine experimental series, each with different initial wavefront distortions. Each series comprised nine correction iterations based on the predictions of the hybrid method. In applica- tions aimed at maximizing the peak intensity, the most stan- dard metric for assessing the compen...

2026

[12] [12]

Hubble space telescope characterized by using phase-retrieval algorithms,

pp. 314–330. 2J. R. Fienup, J. C. Marron, T. J. Schulz, and J. H. Seldin, “Hubble space telescope characterized by using phase-retrieval algorithms,” Applied op- tics32, 1747–1767 (1993). Hybrid deep learning-based phase diversity method for wavefront reconstruction 12 3Y . Jin, J. Chen, C. Wu, Z. Chen, X. Zhang, H.-l. Shen, W. Gong, and K. Si, “Wavefront...

Pith/arXiv arXiv 1993