pith. sign in

arxiv: 2604.08390 · v1 · submitted 2026-04-09 · ⚛️ physics.optics

Closing the Loop in Epitaxy with Machine Learning: Joint Optimization of Growth and Geometry in On-Chip Lasers

Pith reviewed 2026-05-10 17:45 UTC · model grok-4.3

classification ⚛️ physics.optics
keywords machine learningepitaxial growthmicroring lasersBayesian optimizationvariational autoencodersdevice yieldthreshold variancephotonic integrated circuits
0
0 comments X

The pith

Machine learning jointly optimizes epitaxial growth and microring geometry to reach 100 percent lasing yield and cut threshold variance by 73 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that targeting both average performance and consistency across devices through machine learning can overcome the reproducibility limits that hinder scalable photonic circuits. It first applies multi-objective Bayesian optimization to tune growth conditions and laser geometry together, then uses variational autoencoders to trace leftover variations back to specific morphological features. This produces complete yield in fabricated multi-quantum-well microring lasers with a median threshold of 16 microjoules per square centimeter per pulse and emission at 1333 nanometers. A sympathetic reader would care because the method turns uncontrolled process scatter into a diagnosable and reducible factor, moving epitaxy from trial-and-error toward reliable manufacturing loops.

Core claim

By explicitly optimizing for both low lasing threshold and low device-to-device variance in growth and geometry parameters, the workflow achieves 100 percent lasing yield across all designs, a median threshold of 16 μJ cm^{-2} pulse^{-1}, a 73 percent reduction in threshold variance relative to prior best values, and a median emission wavelength of 1333 nm in the O-band; variational autoencoders further isolate the contributions of geometric versus material disorder to the remaining performance spread.

What carries the argument

Multi-objective Bayesian optimization that jointly tunes epitaxial growth parameters and microring geometry, followed by variational autoencoder analysis that attributes residual threshold fluctuations to measurable morphological variations.

If this is right

  • All tested designs lase, removing the usual post-fabrication selection step for these lasers.
  • Threshold variance falls 73 percent while the median threshold reaches 16 μJ cm^{-2} pulse^{-1}.
  • Emission centers at 1333 nm, placing the devices in the telecommunications O-band.
  • Variational autoencoder decomposition separates geometric shape effects from material disorder effects on population-level performance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same joint-optimization loop could be applied to other bottom-up devices where epitaxial scatter limits yield, such as LEDs or solar cells.
  • Identifying which morphological features drive variance supplies concrete targets for tighter in-situ growth monitoring or feedback control.
  • If the VAE-identified features prove controllable, future runs could test whether further variance reduction is possible without changing the optimization targets.
  • Closing the design-growth-performance loop in this way reduces reliance on exhaustive empirical calibration for each new device architecture.

Load-bearing premise

The optimization and autoencoder models correctly identify and reduce the actual sources of growth and fabrication variability without being misled by limited sampling or model assumptions.

What would settle it

Running the same growth and geometry parameters on a fresh batch of devices and finding that the lasing yield drops below 100 percent or the threshold variance does not drop by approximately 73 percent would show the optimization failed to control the dominant variability sources.

Figures

Figures reproduced from arXiv: 2604.08390 by Andre KY Low, Hark Hoe Tan, Kedar Hippalgaonkar, Mihir R. Athavale, Patrick Parkinson, Stephen A. Church, Wei Wen Wong.

Figure 1
Figure 1. Figure 1: Multi-objective Bayesian optimization improves laser performance, field-level uni [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Latent space representations isolate and quantify subtle morphological disorder [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Morphological variations directly drive performance discrepancies in nominally [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Decomposing the drivers of lasing threshold predictability. (a) Breakdown of cross [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
read the original abstract

Achieving device-to-device reproducibility is a critical bottleneck for scalable photonic integrated circuits, as subtle variations in bottom-up epitaxial growth and fabrication severely limit yield. We present a machine learning workflow for III-V multi-quantum well microring lasers that first optimizes growth and geometry parameters via multi-objective Bayesian optimization, then leverages variational autoencoders (VAEs) to attribute residual device-to-device variability to its underlying sources. By explicitly targeting threshold variance alongside absolute performance, we demonstrate 100% lasing yield across all designs. The optimized multi-quantum well microring laser fields achieved a median lasing threshold of $16~\mu\mathrm{J}\,\mathrm{cm}^{-2}\,\mathrm{pulse}^{-1}$, a $73\%$ reduction in threshold variance relative to the previously reported best values, and a median emission wavelength of $1333~\mathrm{nm}$, in the telecommunications O-band. Furthermore, to diagnose residual performance dispersion under nominally identical conditions, VAEs were used to isolate the key components of device morphology that impact performance. This analysis successfully decoupled geometric from material disorder, quantitatively linking previously unmeasured morphological variations to population-level threshold fluctuations. This data-driven workflow bridges the gap between fundamental epitaxy and reliable manufacturing, establishing a generalizable blueprint for designing and yield-optimizing complex, non-linear optoelectronic devices.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents a machine learning workflow for III-V multi-quantum well microring lasers that combines multi-objective Bayesian optimization of growth and geometry parameters (jointly targeting lasing threshold and its variance) with variational autoencoder (VAE) analysis to attribute residual device-to-device variability to geometric versus material sources. The optimized devices are reported to achieve 100% lasing yield, a median threshold of 16 μJ cm^{-2} pulse^{-1}, a 73% reduction in threshold variance relative to prior best values, and a median emission wavelength of 1333 nm in the O-band.

Significance. If the reported experimental outcomes hold under rigorous validation, the work would be significant for scalable photonic integrated circuits by demonstrating a closed-loop, data-driven approach to mitigate variability in epitaxial growth and fabrication. Explicitly optimizing for threshold variance alongside performance, combined with VAE-based decomposition of morphology effects, offers a generalizable blueprint that could improve yield in non-linear optoelectronic devices.

major comments (2)
  1. [Methods] Methods section (optimization workflow): the multi-objective Bayesian optimization is presented without details on the size of the sampled parameter space, number of experimental iterations or feedback loops executed, choice of acquisition function, baseline comparisons (e.g., against grid search or single-objective BO), or statistical tests with error bars for the 100% yield and 73% variance reduction claims. These omissions are load-bearing because the central experimental results depend on demonstrating that the improvements are not due to insufficient sampling or model-induced biases.
  2. [VAE Analysis] VAE analysis section: the decomposition of residual variability into geometric and material components lacks quantitative cross-validation against independent metrology (e.g., AFM or TEM measurements) or physical simulations, making it unclear whether the attributions are robust or potentially artifacts of the latent space. This directly affects the claim that the workflow successfully decouples sources of threshold fluctuations.
minor comments (2)
  1. [Abstract and Results] The abstract and results sections would benefit from explicit statements of the number of devices tested per design and any data exclusion criteria to support the population-level statistics.
  2. [Results] Notation for units (e.g., μJ cm^{-2} pulse^{-1}) is clear but could be standardized in a table of all reported metrics for easier comparison with prior literature.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive review and for highlighting the potential impact of our closed-loop optimization workflow. We address each major comment below and will revise the manuscript to incorporate additional details and clarifications where possible.

read point-by-point responses
  1. Referee: [Methods] Methods section (optimization workflow): the multi-objective Bayesian optimization is presented without details on the size of the sampled parameter space, number of experimental iterations or feedback loops executed, choice of acquisition function, baseline comparisons (e.g., against grid search or single-objective BO), or statistical tests with error bars for the 100% yield and 73% variance reduction claims. These omissions are load-bearing because the central experimental results depend on demonstrating that the improvements are not due to insufficient sampling or model-induced biases.

    Authors: We agree that expanded details on the Bayesian optimization procedure are necessary for reproducibility and to substantiate the central claims. In the revised manuscript, we will augment the Methods section with the dimensionality and bounds of the explored parameter space, the total number of experimental iterations and feedback loops executed, the acquisition function (expected hypervolume improvement), and available baseline comparisons. We will also add statistical tests, error bars, and confidence intervals for the 100% yield and 73% variance reduction. These additions will demonstrate that the reported improvements are not artifacts of limited sampling. revision: yes

  2. Referee: [VAE Analysis] VAE analysis section: the decomposition of residual variability into geometric and material components lacks quantitative cross-validation against independent metrology (e.g., AFM or TEM measurements) or physical simulations, making it unclear whether the attributions are robust or potentially artifacts of the latent space. This directly affects the claim that the workflow successfully decouples sources of threshold fluctuations.

    Authors: We acknowledge that stronger validation of the VAE attributions would increase confidence in the geometric versus material decomposition. In revision, we will add sensitivity analyses, reconstruction error metrics, and qualitative comparisons to available SEM and optical data to support the robustness of the latent-space attributions. However, a full quantitative cross-validation against AFM/TEM metrology or new physical simulations was not performed in this study and would require additional experiments beyond the current scope; we will therefore note this as a limitation while emphasizing the correlation between VAE-derived features and observed threshold fluctuations. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper presents an experimental ML workflow: multi-objective Bayesian optimization jointly targets measured lasing threshold and its variance across growth/geometry parameters, followed by VAE-based post-fabrication decomposition of observed device-to-device variability into geometric versus material components. Reported outcomes (100% yield, median threshold of 16 μJ cm^{-2} pulse^{-1}, 73% variance reduction, 1333 nm median wavelength) are framed as direct experimental measurements on fabricated devices, not as model predictions or quantities derived by construction from fitted inputs. No equations, self-citations, or ansatzes reduce the central claims to tautological re-statements of the optimization targets or prior data; the derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. Inferred elements are standard ML assumptions common to the described techniques.

axioms (1)
  • domain assumption Bayesian optimization with Gaussian processes can efficiently optimize multi-objective functions over epitaxial and geometric parameters
    Implicit in the use of multi-objective Bayesian optimization for growth and geometry tuning.

pith-pipeline@v0.9.0 · 5562 in / 1361 out tokens · 80243 ms · 2026-05-10T17:45:44.570849+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

2 extracted references · 2 canonical work pages

  1. [1]

    Quantum-dot-based deter- ministic photon–emitter interfaces for scalable photonic quantum technology.Nature Nanotechnology2021,16, 1308–1317

    (4) Uppu, R.; Midolo, L.; Zhou, X.; Carolan, J.; Lodahl, P. Quantum-dot-based deter- ministic photon–emitter interfaces for scalable photonic quantum technology.Nature Nanotechnology2021,16, 1308–1317. (5) Huang, D.; Xuan, Z.; Kumar, R.; Levy, C.; Su, G.; Ma, C.; Wu, X.; Liu, S.; Sharma, J.; Kim, J.; Acikalin, T.; Balamuragan, G.; Jaussi, J.; Rong, H. A s...

  2. [2]

    The OpenCV Library.Dr

    (38) Bradski, G. The OpenCV Library.Dr. Dobb’s Journal of Software Tools2000, (39) Zuiderveld, K. InGraphics Gems; Heckbert, P. S., Ed.; Academic Press, 1994; pp 474–485. (40) Kingma, D. P.; Welling, M. Auto-Encoding Variational Bayes.2nd International Con- ference on Learning Representations, ICLR 2014 - Conference Track Proceedings2022, (41) Paszke, A. ...