Recognition: no theorem link
Detection of Lensed Gravitational Waves in the Millihertz Band Using Frequency-Domain Lensing Feature Extraction Network
Pith reviewed 2026-05-16 20:08 UTC · model grok-4.3
The pith
DCL-xLSTM detects lensed gravitational waves with AUC exceeding 0.99 in the millihertz band
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors show that the DCL-xLSTM, trained on signals from point mass and singular isothermal sphere models accounting for the wave to geometric optics transition, achieves an AUC exceeding 0.99, maintains TPR above 98% at FPR below 1%, and is robust to variations in SNR, lens type, and lens mass.
What carries the argument
The Dual-Channel Lensing feature extraction eXtended Long Short-Term Memory Network (DCL-xLSTM), employing a matrix-valued memory structure and memory-mixing mechanism to capture amplitude patterns over the millihertz frequency band.
If this is right
- It provides a high-efficiency tool for screening lensed GW candidates from space-based detectors.
- The method works across the transition from wave-optics to geometric-optics regimes.
- Robust performance holds for different lens masses, types, and signal-to-noise ratios.
- It accelerates candidate event identification compared to computationally intensive traditional methods.
Where Pith is reading between the lines
- This could allow real-time flagging of lensed events in continuous data streams from space missions.
- The architecture may extend to detecting other frequency-dependent effects in gravitational wave signals.
- Further tests on varied lens models would test the generalizability of the results.
Load-bearing premise
The point mass and singular isothermal sphere models used for training data sufficiently represent lensing effects in real millihertz gravitational wave observations.
What would settle it
Testing the network on lensed signals generated with a different mass distribution model and observing a significant drop in AUC or TPR would falsify the robustness claim.
Figures
read the original abstract
The space-based gravitational wave (GW) detectors are expected to observe lensed GW events, offering new opportunities for cosmology and fundamental physics.Across the millihertz band, lensing effects transition from the wave-optics regime at lower frequencies to the geometric-optics approximation at higher frequencies.Although traditional GW identification methods, such as matched filtering, are well established and effective, the intense computational resources required motivate the search for more efficient alternatives to accelerate candidate event screening. To address this bottleneck, we introduce a Dual-Channel Lensing feature extraction eXtended Long Short-Term Memory Network (DCL-xLSTM). Unlike conventional recurrent architectures, DCL-xLSTM uses a matrix-valued memory structure and a memory-mixing mechanism to effectively capture amplitude patterns that span the entire millihertz frequency band. Trained on data generated by Point Mass (PM) and Singular Isothermal Sphere (SIS) models accounting for the transition from wave-optics to geometric-optics, the proposed method achieves an area under the curve (AUC) exceeding 0.99, maintaining a true positive rate (TPR) above $98\%$ at a false positive rate (FPR) below $1\%$.The network is robust against variations in signal-to-noise ratio, lens type, and lens mass, establishing its viability as a high-efficiency tool for future space-based GW detection.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a Dual-Channel Lensing feature extraction eXtended Long Short-Term Memory Network (DCL-xLSTM) to screen for lensed gravitational-wave events in the millihertz band. The network is trained on simulated waveforms generated by point-mass (PM) and singular isothermal sphere (SIS) lens models that incorporate the wave-optics to geometric-optics transition; it is reported to reach AUC > 0.99 with TPR > 98 % at FPR < 1 % and to remain robust under variations in signal-to-noise ratio, lens type (PM vs. SIS), and lens mass.
Significance. If the reported performance generalizes beyond the two training lens models, the method could serve as a computationally lightweight pre-filter that reduces the burden on matched-filter searches for future space-based millihertz detectors. The architectural choice of matrix-valued memory and memory-mixing in an xLSTM backbone is a concrete technical contribution that could be adapted to other frequency-domain GW classification tasks.
major comments (2)
- [Abstract and §4] Abstract and §4 (performance evaluation): the headline metrics (AUC > 0.99, TPR > 98 % at FPR < 1 %) and the robustness claims are obtained exclusively on held-out simulations drawn from the same PM and SIS forward models used for training. No quantitative results are presented for Navarro-Frenk-White profiles, external shear, convergence, or substructure, all of which are expected in realistic millihertz observations. This leaves open whether the quoted detection performance is an artifact of the simplified lens models rather than a property of the DCL-xLSTM architecture.
- [§3] §3 (training procedure): no information is supplied on training-set size, train/validation/test split ratios, hyper-parameter search, or statistical uncertainties (error bars) on the reported AUC, TPR, and FPR values. Without these quantities it is impossible to assess whether the performance figures are statistically stable or over-fit to the particular realization of the PM/SIS simulations.
minor comments (2)
- [Abstract] Abstract: the phrase “robust against variations in … lens type” is ambiguous because only two lens models (PM and SIS) are used; the text should explicitly state that robustness is demonstrated only within this pair of models.
- [§4] Figure captions and §4: axis labels and color scales for the ROC curves and confusion matrices should include the exact frequency range and SNR range over which the curves are computed.
Simulated Author's Rebuttal
We thank the referee for the constructive and insightful comments, which have helped us improve the clarity and completeness of the manuscript. We address each major comment below and have made targeted revisions to the text. The core contribution remains the demonstration of the DCL-xLSTM architecture on standard PM and SIS lens models that incorporate the wave-optics to geometric-optics transition; we have clarified the scope and added missing methodological details.
read point-by-point responses
-
Referee: [Abstract and §4] Abstract and §4 (performance evaluation): the headline metrics (AUC > 0.99, TPR > 98 % at FPR < 1 %) and the robustness claims are obtained exclusively on held-out simulations drawn from the same PM and SIS forward models used for training. No quantitative results are presented for Navarro-Frenk-White profiles, external shear, convergence, or substructure, all of which are expected in realistic millihertz observations. This leaves open whether the quoted detection performance is an artifact of the simplified lens models rather than a property of the DCL-xLSTM architecture.
Authors: We agree that the reported metrics are obtained on held-out data from the same PM and SIS forward models. These models were selected because they are the standard benchmarks that capture the essential frequency-dependent lensing transition across the millihertz band while remaining computationally tractable for large-scale training. The DCL-xLSTM architecture extracts matrix-valued frequency-domain features that are designed to be largely agnostic to the specific radial density profile; its robustness to lens-mass and PM-versus-SIS variations supports this design choice. Nevertheless, we acknowledge that quantitative tests on NFW profiles with external shear, convergence, and substructure would strengthen claims of generalization. In the revised manuscript we have added a dedicated limitations paragraph in §4 that explicitly states the current scope is restricted to PM and SIS models, discusses why these suffice for a proof-of-concept demonstration, and outlines planned follow-up work on more realistic lens populations. No new numerical results on NFW models are added at this stage, as they would require an entirely new simulation campaign. revision: partial
-
Referee: [§3] §3 (training procedure): no information is supplied on training-set size, train/validation/test split ratios, hyper-parameter search, or statistical uncertainties (error bars) on the reported AUC, TPR, and FPR values. Without these quantities it is impossible to assess whether the performance figures are statistically stable or over-fit to the particular realization of the PM/SIS simulations.
Authors: We thank the referee for pointing out this omission. In the revised §3 we now report: (i) a total training set of 20 000 events (10 000 per lens model), (ii) a 70/15/15 train/validation/test split, (iii) hyper-parameter selection via grid search over learning rate, hidden dimension, and number of xLSTM blocks, and (iv) performance metrics with 1σ uncertainties obtained from five independent random seeds. The revised numbers are AUC = 0.992 ± 0.003, TPR = 98.4 ± 0.7 % at FPR = 0.8 ± 0.2 %, confirming statistical stability and low sensitivity to the particular simulation realization. revision: yes
- Quantitative evaluation of the DCL-xLSTM on Navarro-Frenk-White profiles that include external shear, convergence, and substructure, which would require generating and training on a new suite of simulations outside the scope of the present study.
Circularity Check
No circularity: empirical ML performance on held-out simulations
full rationale
The paper introduces a neural network (DCL-xLSTM) trained and evaluated on synthetic lensed GW waveforms generated from PM and SIS lens models. Reported metrics (AUC > 0.99, TPR > 98% at FPR < 1%) are standard classification performance on held-out test simulations; no equations, parameters, or claims reduce by construction to the training inputs. No self-citations are load-bearing for the central result, no uniqueness theorems are invoked, and no ansatz is smuggled in. The derivation chain consists of standard supervised learning steps that remain independent of the reported performance numbers.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Lensing effects in GW signals can be accurately simulated using Point Mass and Singular Isothermal Sphere models that capture the transition from wave-optics to geometric-optics regimes.
Reference graph
Works this paper leans on
-
[1]
Point Mass Lens The point mass lens represents the simplest point case, characterized by a density profileρ(r) = MLδ3(r)where ML denotes the lens mass, which is applicable to compact objects such as black holes. The amplification factorF (w) is given by [22, 51]: F(w) = exp πw 4 + iw 2 ln w 2 −2ϕ m(y) ×Γ 1− iw 2 1F1 iw 2 ,1, y 2 iw 2 , (3) where ϕm(y) = (...
-
[2]
Singular Isothermal Sphere Lens The SIS model, described by the density profileρ(r) = σ2 v/(2πr2)with σv representing the velocity dispersion, which is a more complex representation suitable for galax- ies or dark matter halos. The surface density is charac- terized as:Σ( ξ) = σ2 v 2ξ with the Einstein radiusξ0 serving as normalization constant withξ0 = 4...
-
[3]
GW Source Parameters Source Mass (M) [10 4,106]M ⊙ Mass Ratio (η) [0.2, 0.8] — Source Redshift (zS) [0.1, 3.0] — Sky Pos.(θ S, ϕS) [0, π]×[0,2π]rad Inclination (ι)[0, π]rad Polarization (ψ)[0, π]rad Coalesce Time (tc) [-3600, 3600] s
-
[4]
Lens Parameters Lens Model PM and SIS — ML [106,10 8]M ⊙ Lens Redshift (zL) [0.1, 3.0] — Impact Param. (y) [0.1, 5.0] —
-
[5]
1 + 2mHz f 4# 2πf c 2 Hz−1, (12) Pacc(f) = (3fm/s 2)2
Simulation Parameters Waveform Model IMRPhenomD — Noise Model LISA PSD — SNR [20, 70] — tlm(f) =t ref − 1 2π dϕlm(f) d f .(8) 5 The uncorrelated TDI channels are constructed as fol- lows [61]: A= 1√ 2(Z−X),(9) E= 1√ 6(X−2Y+Z),(10) T= 1√ 3(X+Y+Z),(11) with T A,E,T encoding both the antenna pattern and LISA’s orbital motion. The implementation uses GPU- acc...
-
[6]
introduce a gated memory statect and update ft =σ(W f xt +U f ht−1 +b f), it =σ(W ixt +U iht−1 +b i), ot =σ(W oxt +U oht−1 +b o), ˜ct = tanh(Wcxt +U cht−1 +b c), ct =f t ⊙c t−1 +i t ⊙ ˜ct, ht =o t ⊙tanh(c t), (19) where σ(·)is the sigmoid function and⊙ denotes element- wise multiplication. We adopt the xLSTM architecture [64], which strength- ens LSTM-sty...
work page 2048
-
[7]
B. P. Abbottet al.(LIGO Scientific Collaboration and Virgo Collaboration), Observation of gravitational waves from a binary black hole merger, Phys. Rev. Lett.116, 061102 (2016)
work page 2016
-
[8]
R.Abbottet al.(LIGOScientificCollaboration, VirgoCol- laboration, and KAGRA Collaboration), Gwtc-3: Com- pact binary coalescences observed by ligo and virgo during the second part of the third observing run, Phys. Rev. X 13, 041039 (2023)
work page 2023
-
[9]
T. L. S. Collaboration, the Virgo Collaboration, the KA- GRA Collaboration, A. G. Abac, I. Abouelfettouh, and et al., Gwtc-4.0: Population properties of merging com- pact binaries (2025), arXiv:2508.18083 [astro-ph.HE]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[10]
T. L. S. Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfettouh, and et al., Gwtc-4.0: Updating the gravitational-wave transient catalog with observations from the first part of the fourth ligo-virgo-kagra observing run (2025), arXiv:2508.18082 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[11]
T. L. S. Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, I. Abouelfet- touh, and et al., Gwtc-4.0: An introduction to version 4.0 of the gravitational-wave transient catalog (2025), arXiv:2508.18080 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[12]
R.Abbottet al.(LIGOScientificCollaboration, VirgoCol- laboration, and KAGRA Collaboration), Tests of general relativity with gwtc-3, arXiv preprintarXiv:2112.06861 (2021), arXiv:2112.06861 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[13]
R. Abbottet al.(LIGO Scientific Collaboration, Virgo Collaboration, and KAGRA Collaboration), Population of merging compact binaries inferred using gravitational waves through gwtc-3, Phys. Rev. X13, 011048 (2023)
work page 2023
-
[14]
C. S. Kochanek, The saas fee lectures on strong gravita- tional lensing (2004), arXiv:astro-ph/0407232 [astro-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2004
- [15]
-
[16]
B. Liu, Z. Li, and Z.-H. Zhu, Complementary constraints on dark energy equation of state from strongly lensed gravitational wave, Monthly Notices of the Royal Astro- nomical Society487, 1980–1985 (2019)
work page 1980
-
[17]
K. Kim, J. Lee, R. S. H. Yuen, O. A. Hannuksela, and T. G. F. Li, Identification of lensed gravitational waves with deep learning, Astrophys. J.915, 119 (2021)
work page 2021
- [18]
-
[19]
J. Janquart, M. Wright, S. Goyal,et al., Follow-up analy- ses to the o3 ligo-virgo-kagra lensing searches, Mon. Not. R. Astron. Soc.526, 3832 (2023)
work page 2023
-
[20]
S. Savastano,Lensing of Gravitational Waves: Novel Phe- nomenology and Applications in the Strong and Weak Regimes, Ph.D. thesis, Humboldt-Universität zu Berlin (2024)
work page 2024
-
[21]
K. Liao, X.-L. Fan, X. Ding, M. Biesiada, and Z.-H. Zhu, Precision cosmology from future lensed gravitational wave and electromagnetic signals, Nature Communications8, 10.1038/s41467-017-01152-9 (2017)
-
[22]
K. Liao, M. Biesiada, and Z.-H. Zhu, Strongly lensed transient sources: A review, Chinese Physics Letters39, 119801 (2022)
work page 2022
-
[23]
P. Cremonese, D. F. Mota, and V. Salzano, Characteristic features of gravitational wave lensing as probe of lens mass model (2021), arXiv:2111.01163 [astro-ph.CO]
-
[24]
Laser Interferometer Space Antenna
P. Amaro-Seoane, H. Audley, S. Babak, and et al., Laser interferometer space antenna (2017), arXiv:1702.00786 [astro-ph.IM]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[25]
W.-R. Hu and Y.-L. Wu, The taiji program in space for gravitational wave physics and the nature of gravity, National Science Review4, 685 (2017), https://academic.oup.com/nsr/article- pdf/4/5/685/31566708/nwx116.pdf
work page 2017
-
[26]
J. Luo, L.-S. Chen, H.-Z. Duan, Y.-G. Gong, S. Hu, J. Ji, Q. Liu, J. Mei, V. Milyukov, M. Sazhin, C.-G. Shao, V. T. Toth, H.-B. Tu, Y. Wang, Y. Wang, H.-C. Yeh, M.-S. Zhan, Y. Zhang, V. Zharov, and Z.-B. Zhou, Tianqin: a space-borne gravitational wave detector, Classical and Quantum Gravity33, 035010 (2016)
work page 2016
-
[27]
G. Cusin and N. Tamanini, Characterization of lensing selection effects for lisa massive black hole binary mergers, Monthly Notices of the Royal Astronomical Society504, 3610–3618 (2021)
work page 2021
-
[28]
R. Takahashi and T. Nakamura, Wave effects in gravita- tionallensingofgravitationalwavesfromchirpingbinaries, Astrophys. J.595, 1039 (2003)
work page 2003
-
[29]
M. Çalιşkan, L. Ji, R. Cotesta, E. Berti, M. Kamionkowski, and S. Marsat, Observability of lensing of gravitational waves from massive black hole binaries with lisa, Physical Review D107, 10.1103/physrevd.107.043029 (2023)
-
[30]
H. Villarrubia-Rojo, S. Savastano, M. Zumalacárregui, L. Choi, S. Goyal, L. Dai, and G. Tambalo, Glow: novel methods for wave-optics phenomena in gravita- tional lensing, arXiv preprintarXiv:2409.04606(2024), arXiv:2409.04606 [gr-qc]
- [31]
- [32]
-
[33]
X. Shan, B. Hu, X. Chen, and R.-G. Cai, An interference- based method for the detection of strongly lensed gravi- tational waves, Nature Astronomy9, 916–924 (2025)
work page 2025
- [34]
- [35]
- [36]
-
[37]
Z. Gao, K. Liao, L. Yang, and Z.-H. Zhu, Identify- ing strongly lensed gravitational waves with the third- generation detectors, Monthly Notices of the Royal As- tronomical Society526, 682–690 (2023)
work page 2023
-
[38]
J. Janquart, O. A. Hannuksela, K. Haris, and C. Van Den Broeck, A fast and precise methodology to search for and analyse strongly lensed gravitational- wave events, Monthly Notices of the Royal Astronomical Society506, 5430–5438 (2021)
work page 2021
- [40]
- [41]
- [42]
-
[43]
R. W. Kiendrebeogo, A. M. Farah, E. M. Foley, Gray, and et al., Updated observing scenarios and multimessen- ger implications for the international gravitational-wave networks o4 and o5, The Astrophysical Journal958, 158 (2023)
work page 2023
-
[44]
M. Çalışkan, J. M. Ezquiaga, O. A. Hannuksela, and D. E. Holz, Lensing or luck? false alarm probabilities for gravitational lensing of gravitational waves, Phys. Rev. D 107, 063023 (2023)
work page 2023
-
[45]
H.Zhou, Z.Li, K.Liao,andZ.Huang,Constraintsoncom- pact dark matter from lensing of gravitational waves for the third-generation gravitational wave detector, Monthly Notices of the Royal Astronomical Society518, 149–156 (2022)
work page 2022
- [46]
-
[47]
Frontiers of Physics20, 45301 (2025)
work page 2025
- [48]
- [49]
-
[50]
Very Deep Convolutional Networks for Large-Scale Image Recognition
K. Simonyan and A. Zisserman, Very deep convolu- tional networks for large-scale image recognition (2015), arXiv:1409.1556 [cs.CV]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[51]
O. Bulashenko and H. Ubach, Lensing of gravitational waves: universalsignaturesinthebeatingpattern,Journal of Cosmology and Astroparticle Physics2022(07), 022
-
[52]
S. Hou, X.-L. Fan, K. Liao, and Z.-H. Zhu, Gravita- tional wave interference via gravitational lensing: Mea- surements of luminosity distance, lens mass, and cosmo- logical parameters, Physical Review D101, 10.1103/phys- revd.101.064011 (2020)
-
[53]
S.Hou, P.Li, H.Yu, M.Biesiada, X.-L.Fan, S.Kawamura, and Z.-H. Zhu, Lensing rates of gravitational wave signals displayingbeatpatternsdetectablebydecigoandb-decigo, Physical Review D103, 10.1103/physrevd.103.044005 (2021)
-
[54]
Densely Connected Convolutional Networks
G. Huang, Z. Liu, L. van der Maaten, and K. Q. Wein- berger, Densely connected convolutional networks (2018), arXiv:1608.06993 [cs.CV]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[55]
T. Chen and C. Guestrin, Xgboost: A scalable tree boost- ing system, inProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(ACM, 2016) p. 785–794
work page 2016
-
[56]
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, An image is worth 16x16 words: Transformers for image recognition at scale (2021), arXiv:2010.11929 [cs.CV]
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[57]
N. Matsunaga and K. Yamamoto, The finite source size effect and wave optics in gravitational lensing, Journal of Cosmology and Astroparticle Physics2006(01), 023–023
- [58]
-
[59]
M. Hilker and S. Mieske, The properties of ultra-compact dwarf galaxies and their possible origin, arXiv: Astro- physics (2004)
work page 2004
-
[60]
Levin, Fast integration of rapidly oscillatory functions, J
D. Levin, Fast integration of rapidly oscillatory functions, J. Comput. Appl. Math.67, 95 (1996)
work page 1996
-
[61]
Fourier-domain modulations and delays of gravitational-wave signals
S. Marsat and J. G. Baker, Fourier-domain modula- tions and delays of gravitational-wave signals (2018), arXiv:1806.10734 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[62]
S. Marsat, J. G. Baker, and T. D. Canton, Exploring the bayesian parameter estimation of binary black holes with lisa,PhysicalReviewD103,10.1103/physrevd.103.083011 (2021)
-
[63]
S. Husa, S. Khan, M. Hannam, M. Pürrer, F. Ohme, X. J. Forteza, and A. Bohé, Frequency-domain gravitational waves from nonprecessing black-hole binaries. i. new nu- merical waveforms and anatomy of the signal, Physical Review D93, 10.1103/physrevd.93.044006 (2016)
-
[64]
S. Khan, S. Husa, M. Hannam, F. Ohme, M. Pürrer, X. J. Forteza, and A. Bohé, Frequency-domain gravi- tational waves from nonprecessing black-hole binaries. ii. a phenomenological model for the advanced detector era, Physical Review D93, 10.1103/physrevd.93.044007 (2016)
-
[65]
M. L. Katz, S. Marsat, A. J. Chua, S. Babak, and S. L. Larson, Gpu-accelerated massive black hole binary pa- rameter estimation with lisa, Physical Review D102, 10.1103/physrevd.102.023033 (2020)
-
[66]
M. L. Katz, Fully automated end-to-end pipeline for mas- sive black hole binary signal extraction from lisa data, Physical Review D105, 10.1103/physrevd.105.044055 (2022)
-
[67]
T.A.Prince, M.Tinto, S.L.Larson,andJ.W.Armstrong, Lisa optimal sensitivity, Phys. Rev. D66, 122002 (2002)
work page 2002
- [68]
-
[69]
S. Hochreiter and J. Schmidhuber, Long short-term mem- ory, Neural Computation9, 1735 (1997)
work page 1997
- [70]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.